FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KE0777, 331 aa
1>>>pF1KE0777 331 - 331 aa - 331 aa
Library: human.CCDS.faa
18511270 residues in 32554 sequences
Statistics: Expectation_n fit: rho(ln(x))= 7.3070+/-0.00114; mu= 7.9307+/- 0.068
mean_var=197.7966+/-39.962, 0's: 0 Z-trim(110.0): 88 B-trim: 0 in 0/50
Lambda= 0.091194
statistics sampled from 11189 (11269) to 11189 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.678), E-opt: 0.2 (0.346), width: 16
Scan time: 2.660
The best scores are: opt bits E(32554)
CCDS7279.1 HNRNPH3 gene_id:3189|Hs108|chr10 ( 331) 2362 323.2 1.8e-88
CCDS7278.1 HNRNPH3 gene_id:3189|Hs108|chr10 ( 346) 1490 208.5 6.3e-54
CCDS14485.1 HNRNPH2 gene_id:3188|Hs108|chrX ( 449) 762 112.9 5.1e-25
CCDS4446.1 HNRNPH1 gene_id:3187|Hs108|chr5 ( 449) 756 112.1 8.7e-25
CCDS7204.1 HNRNPF gene_id:3185|Hs108|chr10 ( 415) 653 98.5 1e-20
>>CCDS7279.1 HNRNPH3 gene_id:3189|Hs108|chr10 (331 aa)
initn: 2362 init1: 2362 opt: 2362 Z-score: 1702.0 bits: 323.2 E(32554): 1.8e-88
Smith-Waterman score: 2362; 100.0% identity (100.0% similar) in 331 aa overlap (1-331:1-331)
10 20 30 40 50 60
pF1KE0 MDWVMKHNGPNDASDGTVRLRGLPFGCSKEEIVQFFQGLEIVPNGITLTMDYQGRSTGEA
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS72 MDWVMKHNGPNDASDGTVRLRGLPFGCSKEEIVQFFQGLEIVPNGITLTMDYQGRSTGEA
10 20 30 40 50 60
70 80 90 100 110 120
pF1KE0 FVQFASKEIAENALGKHKERIGHRYIEIFRSSRSEIKGFYDPPRRLLGQRPGPYDRPIGG
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS72 FVQFASKEIAENALGKHKERIGHRYIEIFRSSRSEIKGFYDPPRRLLGQRPGPYDRPIGG
70 80 90 100 110 120
130 140 150 160 170 180
pF1KE0 RGGYYGAGRGSYGGFDDYGGYNNYGYGNDGFDDRMRDGRGMGGHGYGGAGDASSGFHGGH
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS72 RGGYYGAGRGSYGGFDDYGGYNNYGYGNDGFDDRMRDGRGMGGHGYGGAGDASSGFHGGH
130 140 150 160 170 180
190 200 210 220 230 240
pF1KE0 FVHMRGLPFRATENDIANFFSPLNPIRVHIDIGADGRATGEADVEFVTHEDAVAAMSKDK
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS72 FVHMRGLPFRATENDIANFFSPLNPIRVHIDIGADGRATGEADVEFVTHEDAVAAMSKDK
190 200 210 220 230 240
250 260 270 280 290 300
pF1KE0 NNMQHRYIELFLNSTPGGGSGMGGSGMGGYGRDGMDNQGGYGSVGRMGMGNNYSGGYGTP
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS72 NNMQHRYIELFLNSTPGGGSGMGGSGMGGYGRDGMDNQGGYGSVGRMGMGNNYSGGYGTP
250 260 270 280 290 300
310 320 330
pF1KE0 DGLGGYGRGGGGSGGYYGQGGMSGGGWRGMY
:::::::::::::::::::::::::::::::
CCDS72 DGLGGYGRGGGGSGGYYGQGGMSGGGWRGMY
310 320 330
>>CCDS7278.1 HNRNPH3 gene_id:3189|Hs108|chr10 (346 aa)
initn: 1460 init1: 1460 opt: 1490 Z-score: 1081.7 bits: 208.5 E(32554): 6.3e-54
Smith-Waterman score: 2322; 95.7% identity (95.7% similar) in 346 aa overlap (1-331:1-346)
10 20 30 40 50 60
pF1KE0 MDWVMKHNGPNDASDGTVRLRGLPFGCSKEEIVQFFQGLEIVPNGITLTMDYQGRSTGEA
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS72 MDWVMKHNGPNDASDGTVRLRGLPFGCSKEEIVQFFQGLEIVPNGITLTMDYQGRSTGEA
10 20 30 40 50 60
70 80 90 100 110 120
pF1KE0 FVQFASKEIAENALGKHKERIGHRYIEIFRSSRSEIKGFYDPPRRLLGQRPGPYDRPIGG
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS72 FVQFASKEIAENALGKHKERIGHRYIEIFRSSRSEIKGFYDPPRRLLGQRPGPYDRPIGG
70 80 90 100 110 120
130 140 150 160
pF1KE0 RGGYYGAGRGS---------------YGGFDDYGGYNNYGYGNDGFDDRMRDGRGMGGHG
::::::::::: ::::::::::::::::::::::::::::::::::
CCDS72 RGGYYGAGRGSMYDRMRRGGDGYDGGYGGFDDYGGYNNYGYGNDGFDDRMRDGRGMGGHG
130 140 150 160 170 180
170 180 190 200 210 220
pF1KE0 YGGAGDASSGFHGGHFVHMRGLPFRATENDIANFFSPLNPIRVHIDIGADGRATGEADVE
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS72 YGGAGDASSGFHGGHFVHMRGLPFRATENDIANFFSPLNPIRVHIDIGADGRATGEADVE
190 200 210 220 230 240
230 240 250 260 270 280
pF1KE0 FVTHEDAVAAMSKDKNNMQHRYIELFLNSTPGGGSGMGGSGMGGYGRDGMDNQGGYGSVG
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS72 FVTHEDAVAAMSKDKNNMQHRYIELFLNSTPGGGSGMGGSGMGGYGRDGMDNQGGYGSVG
250 260 270 280 290 300
290 300 310 320 330
pF1KE0 RMGMGNNYSGGYGTPDGLGGYGRGGGGSGGYYGQGGMSGGGWRGMY
::::::::::::::::::::::::::::::::::::::::::::::
CCDS72 RMGMGNNYSGGYGTPDGLGGYGRGGGGSGGYYGQGGMSGGGWRGMY
310 320 330 340
>>CCDS14485.1 HNRNPH2 gene_id:3188|Hs108|chrX (449 aa)
initn: 1109 init1: 574 opt: 762 Z-score: 562.7 bits: 112.9 E(32554): 5.1e-25
Smith-Waterman score: 1337; 62.8% identity (78.2% similar) in 349 aa overlap (1-324:93-432)
10 20
pF1KE0 MDWVMKHNGPND---ASDGTVRLRGLPFGC
::::.::.:::. :.:: ::::::::::
CCDS14 SEEEVKLALKKDRETMGHRYVEVFKSNSVEMDWVLKHTGPNSPDTANDGFVRLRGLPFGC
70 80 90 100 110 120
30 40 50 60 70 80
pF1KE0 SKEEIVQFFQGLEIVPNGITLTMDYQGRSTGEAFVQFASKEIAENALGKHKERIGHRYIE
:::::::::.::::::::.:: .:.::::::::::::::.::::.:: ::::::::::::
CCDS14 SKEEIVQFFSGLEIVPNGMTLPVDFQGRSTGEAFVQFASQEIAEKALKKHKERIGHRYIE
130 140 150 160 170 180
90 100 110 120 130
pF1KE0 IFRSSRSEIKGFYDPPRRLLG-QRPGPYDRPIGGRG--------GY----YGAGRGSYGG
::.:::.:.. :::::.:.. ::::::::: .::: :. :: :.:::
CCDS14 IFKSSRAEVRTHYDPPRKLMAMQRPGPYDRPGAGRGYNSIGRGAGFERMRRGAYGGGYGG
190 200 210 220 230 240
140 150 160 170 180 190
pF1KE0 FDDYGGYNN-YGYGNDGFD-DRMRDGRGMGGHGYGGAGDASSGFHG--GHFVHMRGLPFR
.:::::::. ::.:.: : : ::. : :: :..:.:.. :: :::::::.:
CCDS14 YDDYGGYNDGYGFGSDRFGRDLNYCFSGMSDHRYG---DGGSSFQSTTGHCVHMRGLPYR
250 260 270 280 290
200 210 220 230 240 250
pF1KE0 ATENDIANFFSPLNPIRVHIDIGADGRATGEADVEFVTHEDAVAAMSKDKNNMQHRYIEL
:::::: ::::::::.::::.:: :::.::::::::.:::::::::.::: ::::::.::
CCDS14 ATENDIYNFFSPLNPMRVHIEIGPDGRVTGEADVEFATHEDAVAAMAKDKANMQHRYVEL
300 310 320 330 340 350
260 270 280 290 300
pF1KE0 FLNSTPG-GGSGMGGSGMGGYGRDGMDNQGG-YGS--VGRMGMGNNYS-GGYGTPDGLGG
::::: : .:... : . . . .:: ::: .: ::..:. : :: .. . ::
CCDS14 FLNSTAGTSGGAYDHSYVELFLNSTAGASGGAYGSQMMGGMGLSNQSSYGGPASQQLSGG
360 370 380 390 400 410
310 320 330
pF1KE0 YGRGGGGSGGYYGQGGMSGGGWRGMY
:: ::: ::..:::
CCDS14 YG------GGYGGQSSMSGYDQVLQENSSDYQSNLA
420 430 440
>>CCDS4446.1 HNRNPH1 gene_id:3187|Hs108|chr5 (449 aa)
initn: 1213 init1: 579 opt: 756 Z-score: 558.4 bits: 112.1 E(32554): 8.7e-25
Smith-Waterman score: 1329; 62.5% identity (77.3% similar) in 352 aa overlap (1-324:93-432)
10 20
pF1KE0 MDWVMKHNGPND---ASDGTVRLRGLPFGC
::::.::.:::. :.:: ::::::::::
CCDS44 SEDEVKLALKKDRETMGHRYVEVFKSNNVEMDWVLKHTGPNSPDTANDGFVRLRGLPFGC
70 80 90 100 110 120
30 40 50 60 70 80
pF1KE0 SKEEIVQFFQGLEIVPNGITLTMDYQGRSTGEAFVQFASKEIAENALGKHKERIGHRYIE
:::::::::.::::::::::: .:.::::::::::::::.::::.:: ::::::::::::
CCDS44 SKEEIVQFFSGLEIVPNGITLPVDFQGRSTGEAFVQFASQEIAEKALKKHKERIGHRYIE
130 140 150 160 170 180
90 100 110 120 130
pF1KE0 IFRSSRSEIKGFYDPPRRLLG-QRPGPYDRPIGGRG--------GY----YGAGRGSYGG
::.:::.:.. :::::.:.. ::::::::: .::: :. :: :.:::
CCDS44 IFKSSRAEVRTHYDPPRKLMAMQRPGPYDRPGAGRGYNSIGRGAGFERMRRGAYGGGYGG
190 200 210 220 230 240
140 150 160 170 180 190
pF1KE0 FDDYGGYNN-YGYGNDGFD-DRMRDGRGMGGHGYGGAGDASSGFHG--GHFVHMRGLPFR
.:::.:::. ::.:.: : : ::. : :: :..: :.. :: :::::::.:
CCDS44 YDDYNGYNDGYGFGSDRFGRDLNYCFSGMSDHRYG---DGGSTFQSTTGHCVHMRGLPYR
250 260 270 280 290
200 210 220 230 240 250
pF1KE0 ATENDIANFFSPLNPIRVHIDIGADGRATGEADVEFVTHEDAVAAMSKDKNNMQHRYIEL
:::::: ::::::::.::::.:: :::.::::::::.::::::::::::: ::::::.::
CCDS44 ATENDIYNFFSPLNPVRVHIEIGPDGRVTGEADVEFATHEDAVAAMSKDKANMQHRYVEL
300 310 320 330 340 350
260 270 280 290 300
pF1KE0 FLNSTPGGGSGMGGSGMGGYGRDGMDNQGG-----YGS--VGRMGMGNNYS-GGYGTPDG
::::: :.. ::. : . ... .: ::: .: ::..:. : :: .. .
CCDS44 FLNSTAGAS---GGAYEHRYVELFLNSTAGASGGAYGSQMMGGMGLSNQSSYGGPASQQL
360 370 380 390 400 410
310 320 330
pF1KE0 LGGYGRGGGGSGGYYGQGGMSGGGWRGMY
:::: ::: ::..:::
CCDS44 SGGYG------GGYGGQSSMSGYDQVLQENSSDFQSNIA
420 430 440
>>CCDS7204.1 HNRNPF gene_id:3185|Hs108|chr10 (415 aa)
initn: 1054 init1: 498 opt: 653 Z-score: 485.6 bits: 98.5 E(32554): 1e-20
Smith-Waterman score: 1165; 57.3% identity (77.4% similar) in 328 aa overlap (1-306:93-414)
10 20
pF1KE0 MDWVMKHNGPNDA---SDGTVRLRGLPFGC
::::.::.:::.: .:: ::::::::::
CCDS72 SEDDVKMALKKDRESMGHRYIEVFKSHRTEMDWVLKHSGPNSADSANDGFVRLRGLPFGC
70 80 90 100 110 120
30 40 50 60 70 80
pF1KE0 SKEEIVQFFQGLEIVPNGITLTMDYQGRSTGEAFVQFASKEIAENALGKHKERIGHRYIE
.::::::::.::::::::::: .: .:. ::::::::::.:.::.:::::::::::::::
CCDS72 TKEEIVQFFSGLEIVPNGITLPVDPEGKITGEAFVQFASQELAEKALGKHKERIGHRYIE
130 140 150 160 170 180
90 100 110 120 130
pF1KE0 IFRSSRSEIKGFYDPPRRLLG-QRPGPYDRP------IG-----G----RGGYYGAGRGS
.:.::. :.... ::: .... ::::::::: :: : : : :..:
CCDS72 VFKSSQEEVRSYSDPPLKFMSVQRPGPYDRPGTARRYIGIVKQAGLERMRPGAYSTG---
190 200 210 220 230
140 150 160 170 180
pF1KE0 YGGFDDYGGYNN-YGYGNDGFD-DRMRDGRGMGGHGYGGAGDASSGFHGGHFVHMRGLPF
:::...:.: .. ::. .: : : :: : :: . . . :: :::::::.
CCDS72 YGGYEEYSGLSDGYGFTTDLFGRDLSYCLSGMYDHRYGDS-EFTVQSTTGHCVHMRGLPY
240 250 260 270 280 290
190 200 210 220 230 240
pF1KE0 RATENDIANFFSPLNPIRVHIDIGADGRATGEADVEFVTHEDAVAAMSKDKNNMQHRYIE
.:::::: ::::::::.::::.:: :::.::::::::.:::.::::::::. ::::::::
CCDS72 KATENDIYNFFSPLNPVRVHIEIGPDGRVTGEADVEFATHEEAVAAMSKDRANMQHRYIE
300 310 320 330 340 350
250 260 270 280 290 300
pF1KE0 LFLNSTPGGGSGMGGSG-MGGYGRDGMDNQGGYGSVGRMGMGNNYSGGYGTPDGLGGYGR
:::::: :...: .: : :.: .. :. :... ..... :..::. ...:::
CCDS72 LFLNSTTGASNGAYSSQVMQGMGVSA--AQATYSGLESQSVSGCYGAGYSGQNSMGGYD
360 370 380 390 400 410
310 320 330
pF1KE0 GGGGSGGYYGQGGMSGGGWRGMY
331 residues in 1 query sequences
18511270 residues in 32554 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Sat Nov 5 03:07:06 2016 done: Sat Nov 5 03:07:06 2016
Total Scan time: 2.660 Total Display time: 0.020
Function used was FASTA [36.3.4 Apr, 2011]