FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KB4679, 201 aa
1>>>pF1KB4679 201 - 201 aa - 201 aa
Library: human.CCDS.faa
18511270 residues in 32554 sequences
Statistics: Expectation_n fit: rho(ln(x))= 6.6978+/-0.000847; mu= 9.8451+/- 0.051
mean_var=201.0270+/-42.509, 0's: 0 Z-trim(114.0): 76 B-trim: 853 in 1/53
Lambda= 0.090458
statistics sampled from 14494 (14576) to 14494 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.783), E-opt: 0.2 (0.448), width: 16
Scan time: 2.190
The best scores are: opt bits E(32554)
CCDS58580.1 SRSF1 gene_id:6426|Hs108|chr17 ( 201) 1431 198.1 3e-51
CCDS11600.1 SRSF1 gene_id:6426|Hs108|chr17 ( 248) 1307 182.1 2.5e-46
CCDS9199.1 SRSF9 gene_id:8683|Hs108|chr12 ( 221) 865 124.3 5.5e-29
CCDS333.1 SRSF4 gene_id:6429|Hs108|chr1 ( 494) 464 72.4 5.1e-13
CCDS32109.1 SRSF5 gene_id:6430|Hs108|chr14 ( 272) 456 71.1 7.3e-13
CCDS13318.1 SRSF6 gene_id:6431|Hs108|chr20 ( 344) 449 70.3 1.6e-12
>>CCDS58580.1 SRSF1 gene_id:6426|Hs108|chr17 (201 aa)
initn: 1431 init1: 1431 opt: 1431 Z-score: 1033.6 bits: 198.1 E(32554): 3e-51
Smith-Waterman score: 1431; 100.0% identity (100.0% similar) in 201 aa overlap (1-201:1-201)
10 20 30 40 50 60
pF1KB4 MSGGGVIRGPAGNNDCRIYVGNLPPDIRTKDIEDVFYKYGAIRDIDLKNRRGGPPFAFVE
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS58 MSGGGVIRGPAGNNDCRIYVGNLPPDIRTKDIEDVFYKYGAIRDIDLKNRRGGPPFAFVE
10 20 30 40 50 60
70 80 90 100 110 120
pF1KB4 FEDPRDAEDAVYGRDGYDYDGYRLRVEFPRSGRGTGRGGGGGGGGGAPRGRYGPPSRRSE
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS58 FEDPRDAEDAVYGRDGYDYDGYRLRVEFPRSGRGTGRGGGGGGGGGAPRGRYGPPSRRSE
70 80 90 100 110 120
130 140 150 160 170 180
pF1KB4 NRVVVSGLPPSGSWQDLKDHMREAGDVCYADVYRDGTGVVEFVRKEDMTYAVRKLDNTKF
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS58 NRVVVSGLPPSGSWQDLKDHMREAGDVCYADVYRDGTGVVEFVRKEDMTYAVRKLDNTKF
130 140 150 160 170 180
190 200
pF1KB4 RSHEVGYTRILFFDQNWIQWS
:::::::::::::::::::::
CCDS58 RSHEVGYTRILFFDQNWIQWS
190 200
>>CCDS11600.1 SRSF1 gene_id:6426|Hs108|chr17 (248 aa)
initn: 1303 init1: 1303 opt: 1307 Z-score: 945.1 bits: 182.1 E(32554): 2.5e-46
Smith-Waterman score: 1307; 96.9% identity (98.4% similar) in 192 aa overlap (1-190:1-192)
10 20 30 40 50 60
pF1KB4 MSGGGVIRGPAGNNDCRIYVGNLPPDIRTKDIEDVFYKYGAIRDIDLKNRRGGPPFAFVE
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS11 MSGGGVIRGPAGNNDCRIYVGNLPPDIRTKDIEDVFYKYGAIRDIDLKNRRGGPPFAFVE
10 20 30 40 50 60
70 80 90 100 110 120
pF1KB4 FEDPRDAEDAVYGRDGYDYDGYRLRVEFPRSGRGTGRGGGGGGGGGAPRGRYGPPSRRSE
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS11 FEDPRDAEDAVYGRDGYDYDGYRLRVEFPRSGRGTGRGGGGGGGGGAPRGRYGPPSRRSE
70 80 90 100 110 120
130 140 150 160 170 180
pF1KB4 NRVVVSGLPPSGSWQDLKDHMREAGDVCYADVYRDGTGVVEFVRKEDMTYAVRKLDNTKF
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS11 NRVVVSGLPPSGSWQDLKDHMREAGDVCYADVYRDGTGVVEFVRKEDMTYAVRKLDNTKF
130 140 150 160 170 180
190 200
pF1KB4 RSHE--VGYTRILFFDQNWIQWS
:::: ..: :.
CCDS11 RSHEGETAYIRVKVDGPRSPSYGRSRSRSRSRSRSRSRSNSRSRSYSPRRSRGSPRYSPR
190 200 210 220 230 240
>>CCDS9199.1 SRSF9 gene_id:8683|Hs108|chr12 (221 aa)
initn: 507 init1: 462 opt: 865 Z-score: 633.9 bits: 124.3 E(32554): 5.5e-29
Smith-Waterman score: 875; 68.9% identity (82.4% similar) in 193 aa overlap (1-190:1-182)
10 20 30 40 50 60
pF1KB4 MSGGGVIRGPAGNNDCRIYVGNLPPDIRTKDIEDVFYKYGAIRDIDLKNRRGGPPFAFVE
::: . :: :..: :::::::: :.: ::.::.::::: ::.:.::::.: :::::.
CCDS91 MSGWADERG--GEGDGRIYVGNLPTDVREKDLEDLFYKYGRIREIELKNRHGLVPFAFVR
10 20 30 40 50
70 80 90 100 110
pF1KB4 FEDPRDAEDAVYGRDGYDYDGYRLRVEFPRSGRGTGRGGGGGGGGGAPRG-RYGPPSRRS
::::::::::.:::.:::: ::::::::. :: :: ::: : :::.:::
CCDS91 FEDPRDAEDAIYGRNGYDYGQCRLRVEFPRTY---------GGRGGWPRGGRNGPPTRRS
60 70 80 90 100
120 130 140 150 160 170
pF1KB4 ENRVVVSGLPPSGSWQDLKDHMREAGDVCYADVYRDGTGVVEFVRKEDMTYAVRKLDNTK
. ::.:::::::::::::::::::::::::::: .::.:.::..::::: ::.::::.::
CCDS91 DFRVLVSGLPPSGSWQDLKDHMREAGDVCYADVQKDGVGMVEYLRKEDMEYALRKLDDTK
110 120 130 140 150 160
180 190 200
pF1KB4 FRSHE--VGYTRILFFDQNWIQWS
::::: ..: :.
CCDS91 FRSHEGETSYIRVYPERSTSYGYSRSRSGSRGRDSPYQSRGSPHYFSPFRPY
170 180 190 200 210 220
>>CCDS333.1 SRSF4 gene_id:6429|Hs108|chr1 (494 aa)
initn: 443 init1: 143 opt: 464 Z-score: 347.3 bits: 72.4 E(32554): 5.1e-13
Smith-Waterman score: 464; 45.4% identity (66.1% similar) in 174 aa overlap (17-185:3-170)
10 20 30 40 50 60
pF1KB4 MSGGGVIRGPAGNNDCRIYVGNLPPDIRTKDIEDVFYKYGAIRDIDLKNRRGGPPFAFVE
:.:.: : . : .:.: : :: : ..:::: : :::
CCDS33 MPRVYIGRLSYQARERDVERFFKGYGKILEVDLKNGYG-----FVE
10 20 30 40
70 80 90 100 110
pF1KB4 FEDPRDAEDAVYGRDGYDYDGYRLRVEFPRSGRGTGRGGGGGGGGG---APRGRYGPPSR
:.: :::.:::: .: : : :. :: :. : : :.: .: : . : .::::.:
CCDS33 FDDLRDADDAVYELNGKDLCGERVIVEHARGPRRDGSYGSGRSGYGYRRSGRDKYGPPTR
50 60 70 80 90 100
120 130 140 150 160 170
pF1KB4 RSENRVVVSGLPPSGSWQDLKDHMREAGDVCYADVY--RDGTGVVEFVRKEDMTYAVRKL
.: :..: .: :::::::.::.::.: :::.. : . ::.::: :: :..::
CCDS33 -TEYRLIVENLSSRCSWQDLKDYMRQAGEVTYADAHKGRKNEGVIEFVSYSDMKRALEKL
110 120 130 140 150 160
180 190 200
pF1KB4 DNTKFRSHEVGYTRILFFDQNWIQWS
:.:. ....
CCDS33 DGTEVNGRKIRLVEDKPGSRRRRSYSRSRSHSRSRSRSRHSRKSRSRSGSSKSSHSKSRS
170 180 190 200 210 220
>>CCDS32109.1 SRSF5 gene_id:6430|Hs108|chr14 (272 aa)
initn: 474 init1: 175 opt: 456 Z-score: 344.5 bits: 71.1 E(32554): 7.3e-13
Smith-Waterman score: 456; 45.8% identity (67.8% similar) in 177 aa overlap (16-185:4-174)
10 20 30 40 50 60
pF1KB4 MSGGGVIRGPAGNNDCRIYVGNLPPDIRTKDIEDVFYKYGAIRDIDLKNRRGGPPFAFVE
::...: : : : ::.: : :: ::::::: :: :.:::
CCDS32 MSGCRVFIGRLNPAAREKDVERFFKGYGRIRDIDLK--RG---FGFVE
10 20 30 40
70 80 90 100 110
pF1KB4 FEDPRDAEDAVYGRDGYDYDGYRLRVEFPRS-GRGTGRGGGGGG---GGGAPRG-RYGPP
:::::::.:::: :: . . :. .: :. .:: ::: : . .. ::. : . :
CCDS32 FEDPRDADDAVYELDGKELCSERVTIEHARARSRG-GRGRGRYSDRFSSRRPRNDRRNAP
50 60 70 80 90 100
120 130 140 150 160 170
pF1KB4 SRRSENRVVVSGLPPSGSWQDLKDHMREAGDVCYADVYRD--GTGVVEFVRKEDMTYAVR
:.:::..: .: ::::::: ::.::.: .::..: . :::::. :. :..
CCDS32 PVRTENRLIVENLSSRVSWQDLKDFMRQAGEVTFADAHRPKLNEGVVEFASYGDLKNAIE
110 120 130 140 150 160
180 190 200
pF1KB4 KLDNTKFRSHEVGYTRILFFDQNWIQWS
::.. .. ....
CCDS32 KLSGKEINGRKIKLIEGSKRHSRSRSRSRSRTRSSSRSRSRSRSRSRKSYSRSRSRSRSR
170 180 190 200 210 220
>>CCDS13318.1 SRSF6 gene_id:6431|Hs108|chr20 (344 aa)
initn: 522 init1: 159 opt: 449 Z-score: 338.4 bits: 70.3 E(32554): 1.6e-12
Smith-Waterman score: 449; 45.6% identity (65.0% similar) in 180 aa overlap (17-185:3-176)
10 20 30 40 50 60
pF1KB4 MSGGGVIRGPAGNNDCRIYVGNLPPDIRTKDIEDVFYKYGAIRDIDLKNRRGGPPFAFVE
:.:.: : ..: :::. : :: . ..:::: : :::
CCDS13 MPRVYIGRLSYNVREKDIQRFFSGYGRLLEVDLKNGYG-----FVE
10 20 30 40
70 80 90 100 110
pF1KB4 FEDPRDAEDAVYGRDGYDYDGYRLRVEF---PRSGR-GTGRGGGGGGGGGAPR---GR--
::: :::.:::: .: . : :. :: :: : : . :. .:::: . : ::
CCDS13 FEDSRDADDAVYELNGKELCGERVIVEHARGPRRDRDGYSYGSRSGGGGYSSRRTSGRDK
50 60 70 80 90 100
120 130 140 150 160
pF1KB4 YGPPSRRSENRVVVSGLPPSGSWQDLKDHMREAGDVCYADVYRDGT--GVVEFVRKEDMT
:::: : .: :..: .: ::::::: ::.::.: :::.... : ::.:: ::
CCDS13 YGPPVR-TEYRLIVENLSSRCSWQDLKDFMRQAGEVTYADAHKERTNEGVIEFRSYSDMK
110 120 130 140 150 160
170 180 190 200
pF1KB4 YAVRKLDNTKFRSHEVGYTRILFFDQNWIQWS
:. :::.:.. ....
CCDS13 RALDKLDGTEINGRNIRLIEDKPRTSHRRSYSGSRSRSRSRRRSRSRSRRSSRSRSRSIS
170 180 190 200 210 220
201 residues in 1 query sequences
18511270 residues in 32554 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Thu Nov 3 15:19:51 2016 done: Thu Nov 3 15:19:51 2016
Total Scan time: 2.190 Total Display time: 0.000
Function used was FASTA [36.3.4 Apr, 2011]