FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE0665, 240 aa 1>>>pF1KE0665 240 - 240 aa - 240 aa Library: /omim/omim.rfq.tfa 60827320 residues in 85289 sequences Statistics: Expectation_n fit: rho(ln(x))= 6.4875+/-0.000388; mu= 11.3487+/- 0.024 mean_var=193.3228+/-38.335, 0's: 0 Z-trim(119.0): 29 B-trim: 0 in 0/53 Lambda= 0.092243 statistics sampled from 32576 (32605) to 32576 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.726), E-opt: 0.2 (0.382), width: 16 Scan time: 6.120 The best scores are: opt bits E(85289) NP_006749 (OMIM: 191317) splicing factor U2AF 35 k ( 240) 1677 235.0 9.1e-62 NP_001020374 (OMIM: 191317) splicing factor U2AF 3 ( 240) 1638 229.8 3.3e-60 NP_001020375 (OMIM: 191317) splicing factor U2AF 3 ( 167) 1180 168.6 6e-42 XP_016883957 (OMIM: 191317) PREDICTED: splicing fa ( 207) 889 130.0 3.1e-30 XP_011528045 (OMIM: 191317) PREDICTED: splicing fa ( 207) 889 130.0 3.1e-30 NP_001035515 (OMIM: 601080) splicing factor U2AF 2 ( 181) 680 102.1 6.7e-22 XP_005274654 (OMIM: 300028) PREDICTED: U2 small nu ( 348) 444 71.1 2.8e-12 XP_011543892 (OMIM: 300028) PREDICTED: U2 small nu ( 348) 444 71.1 2.8e-12 XP_016885372 (OMIM: 300028) PREDICTED: U2 small nu ( 348) 444 71.1 2.8e-12 XP_016885371 (OMIM: 300028) PREDICTED: U2 small nu ( 348) 444 71.1 2.8e-12 XP_011543891 (OMIM: 300028) PREDICTED: U2 small nu ( 460) 444 71.3 3.3e-12 NP_005080 (OMIM: 300028) U2 small nuclear ribonucl ( 482) 444 71.3 3.4e-12 XP_016885370 (OMIM: 300028) PREDICTED: U2 small nu ( 486) 444 71.3 3.5e-12 NP_659424 (OMIM: 601080) splicing factor U2AF 26 k ( 202) 369 60.8 2.1e-09 >>NP_006749 (OMIM: 191317) splicing factor U2AF 35 kDa s (240 aa) initn: 1677 init1: 1677 opt: 1677 Z-score: 1229.9 bits: 235.0 E(85289): 9.1e-62 Smith-Waterman score: 1677; 100.0% identity (100.0% similar) in 240 aa overlap (1-240:1-240) 10 20 30 40 50 60 pF1KE0 MAEYLASIFGTEKDKVNCSFYFKIGACRHGDRCSRLHNKPTFSQTIALLNIYRNPQNSSQ :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_006 MAEYLASIFGTEKDKVNCSFYFKIGACRHGDRCSRLHNKPTFSQTIALLNIYRNPQNSSQ 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE0 SADGLRCAVSDVEMQEHYDEFFEEVFTEMEEKYGEVEEMNVCDNLGDHLVGNVYVKFRRE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_006 SADGLRCAVSDVEMQEHYDEFFEEVFTEMEEKYGEVEEMNVCDNLGDHLVGNVYVKFRRE 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE0 EDAEKAVIDLNNRWFNGQPIHAELSPVTDFREACCRQYEMGECTRGGFCNFMHLKPISRE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_006 EDAEKAVIDLNNRWFNGQPIHAELSPVTDFREACCRQYEMGECTRGGFCNFMHLKPISRE 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE0 LRRELYGRRRKKHRSRSRSRERRSRSRDRGRGGGGGGGGGGGGRERDRRRSRDRERSGRF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_006 LRRELYGRRRKKHRSRSRSRERRSRSRDRGRGGGGGGGGGGGGRERDRRRSRDRERSGRF 190 200 210 220 230 240 >>NP_001020374 (OMIM: 191317) splicing factor U2AF 35 kD (240 aa) initn: 1638 init1: 1638 opt: 1638 Z-score: 1201.8 bits: 229.8 E(85289): 3.3e-60 Smith-Waterman score: 1638; 97.1% identity (98.8% similar) in 240 aa overlap (1-240:1-240) 10 20 30 40 50 60 pF1KE0 MAEYLASIFGTEKDKVNCSFYFKIGACRHGDRCSRLHNKPTFSQTIALLNIYRNPQNSSQ :::::::::::::::::::::::::::::::::::::::::::::: . :::::::::.: NP_001 MAEYLASIFGTEKDKVNCSFYFKIGACRHGDRCSRLHNKPTFSQTILIQNIYRNPQNSAQ 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE0 SADGLRCAVSDVEMQEHYDEFFEEVFTEMEEKYGEVEEMNVCDNLGDHLVGNVYVKFRRE .::: .:::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 TADGSHCAVSDVEMQEHYDEFFEEVFTEMEEKYGEVEEMNVCDNLGDHLVGNVYVKFRRE 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE0 EDAEKAVIDLNNRWFNGQPIHAELSPVTDFREACCRQYEMGECTRGGFCNFMHLKPISRE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 EDAEKAVIDLNNRWFNGQPIHAELSPVTDFREACCRQYEMGECTRGGFCNFMHLKPISRE 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE0 LRRELYGRRRKKHRSRSRSRERRSRSRDRGRGGGGGGGGGGGGRERDRRRSRDRERSGRF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 LRRELYGRRRKKHRSRSRSRERRSRSRDRGRGGGGGGGGGGGGRERDRRRSRDRERSGRF 190 200 210 220 230 240 >>NP_001020375 (OMIM: 191317) splicing factor U2AF 35 kD (167 aa) initn: 1180 init1: 1180 opt: 1180 Z-score: 874.1 bits: 168.6 E(85289): 6e-42 Smith-Waterman score: 1180; 100.0% identity (100.0% similar) in 167 aa overlap (74-240:1-167) 50 60 70 80 90 100 pF1KE0 QTIALLNIYRNPQNSSQSADGLRCAVSDVEMQEHYDEFFEEVFTEMEEKYGEVEEMNVCD :::::::::::::::::::::::::::::: NP_001 MQEHYDEFFEEVFTEMEEKYGEVEEMNVCD 10 20 30 110 120 130 140 150 160 pF1KE0 NLGDHLVGNVYVKFRREEDAEKAVIDLNNRWFNGQPIHAELSPVTDFREACCRQYEMGEC :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 NLGDHLVGNVYVKFRREEDAEKAVIDLNNRWFNGQPIHAELSPVTDFREACCRQYEMGEC 40 50 60 70 80 90 170 180 190 200 210 220 pF1KE0 TRGGFCNFMHLKPISRELRRELYGRRRKKHRSRSRSRERRSRSRDRGRGGGGGGGGGGGG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 TRGGFCNFMHLKPISRELRRELYGRRRKKHRSRSRSRERRSRSRDRGRGGGGGGGGGGGG 100 110 120 130 140 150 230 240 pF1KE0 RERDRRRSRDRERSGRF ::::::::::::::::: NP_001 RERDRRRSRDRERSGRF 160 >>XP_016883957 (OMIM: 191317) PREDICTED: splicing factor (207 aa) initn: 889 init1: 889 opt: 889 Z-score: 663.8 bits: 130.0 E(85289): 3.1e-30 Smith-Waterman score: 1341; 83.3% identity (85.0% similar) in 240 aa overlap (1-240:1-207) 10 20 30 40 50 60 pF1KE0 MAEYLASIFGTEKDKVNCSFYFKIGACRHGDRCSRLHNKPTFSQTIALLNIYRNPQNSSQ :::::::::::::::::::::::::::::::::::::::::::::: . :::::::::.: XP_016 MAEYLASIFGTEKDKVNCSFYFKIGACRHGDRCSRLHNKPTFSQTILIQNIYRNPQNSAQ 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE0 SADGLRCAVSDVEMQEHYDEFFEEVFTEMEEKYGEVEEMNVCDNLGDHLVGNVYVKFRRE .::: .::::::::::::::::: :::: XP_016 TADGSHCAVSDVEMQEHYDEFFE---------------------------------FRRE 70 80 130 140 150 160 170 180 pF1KE0 EDAEKAVIDLNNRWFNGQPIHAELSPVTDFREACCRQYEMGECTRGGFCNFMHLKPISRE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_016 EDAEKAVIDLNNRWFNGQPIHAELSPVTDFREACCRQYEMGECTRGGFCNFMHLKPISRE 90 100 110 120 130 140 190 200 210 220 230 240 pF1KE0 LRRELYGRRRKKHRSRSRSRERRSRSRDRGRGGGGGGGGGGGGRERDRRRSRDRERSGRF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_016 LRRELYGRRRKKHRSRSRSRERRSRSRDRGRGGGGGGGGGGGGRERDRRRSRDRERSGRF 150 160 170 180 190 200 >>XP_011528045 (OMIM: 191317) PREDICTED: splicing factor (207 aa) initn: 889 init1: 889 opt: 889 Z-score: 663.8 bits: 130.0 E(85289): 3.1e-30 Smith-Waterman score: 1380; 86.2% identity (86.2% similar) in 240 aa overlap (1-240:1-207) 10 20 30 40 50 60 pF1KE0 MAEYLASIFGTEKDKVNCSFYFKIGACRHGDRCSRLHNKPTFSQTIALLNIYRNPQNSSQ :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_011 MAEYLASIFGTEKDKVNCSFYFKIGACRHGDRCSRLHNKPTFSQTIALLNIYRNPQNSSQ 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE0 SADGLRCAVSDVEMQEHYDEFFEEVFTEMEEKYGEVEEMNVCDNLGDHLVGNVYVKFRRE ::::::::::::::::::::::: :::: XP_011 SADGLRCAVSDVEMQEHYDEFFE---------------------------------FRRE 70 80 130 140 150 160 170 180 pF1KE0 EDAEKAVIDLNNRWFNGQPIHAELSPVTDFREACCRQYEMGECTRGGFCNFMHLKPISRE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_011 EDAEKAVIDLNNRWFNGQPIHAELSPVTDFREACCRQYEMGECTRGGFCNFMHLKPISRE 90 100 110 120 130 140 190 200 210 220 230 240 pF1KE0 LRRELYGRRRKKHRSRSRSRERRSRSRDRGRGGGGGGGGGGGGRERDRRRSRDRERSGRF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_011 LRRELYGRRRKKHRSRSRSRERRSRSRDRGRGGGGGGGGGGGGRERDRRRSRDRERSGRF 150 160 170 180 190 200 >>NP_001035515 (OMIM: 601080) splicing factor U2AF 26 kD (181 aa) initn: 688 init1: 666 opt: 680 Z-score: 514.1 bits: 102.1 E(85289): 6.7e-22 Smith-Waterman score: 892; 64.2% identity (74.8% similar) in 218 aa overlap (1-212:1-179) 10 20 30 40 50 60 pF1KE0 MAEYLASIFGTEKDKVNCSFYFKIGACRHGDRCSRLHNKPTFSQTIALLNIYRNPQNSSQ :::::::::::::::::::::::::.:::::::::::::::::: NP_001 MAEYLASIFGTEKDKVNCSFYFKIGVCRHGDRCSRLHNKPTFSQ---------------- 10 20 30 40 70 80 90 100 110 120 pF1KE0 SADGLRCAVSDVEMQEHYDEFFEEVFTEMEEKYGEVEEMNVCDNLGDHLVGNVYVKFRRE :::::..:::::.:::::::::::::::::::::::: NP_001 -----------------------EVFTELQEKYGEIEEMNVCDNLGDHLVGNVYVKFRRE 50 60 70 80 130 140 150 160 170 180 pF1KE0 EDAEKAVIDLNNRWFNGQPIHAELSPVTDFREACCRQYEMGECTRGGFCNFMHLKPISRE ::.:.:: .:.::::::: .:.::::::::::.:::::::::::::::::::::.:::.. NP_001 EDGERAVAELSNRWFNGQAVHGELSPVTDFRESCCRQYEMGECTRGGFCNFMHLRPISQN 90 100 110 120 130 140 190 200 210 220 230 pF1KE0 LRRELYGR--RRK---KHRSRSRSRERRSR-SRDRGRGGGGGGGGGGGGRERDRRRSRDR :.:.:::: ::. . .. . ::: : : :. .: NP_001 LQRQLYGRGPRRRSPPRFHTGHHPRERNHRCSPDHWHGRF 150 160 170 180 240 pF1KE0 ERSGRF >>XP_005274654 (OMIM: 300028) PREDICTED: U2 small nuclea (348 aa) initn: 475 init1: 230 opt: 444 Z-score: 341.4 bits: 71.1 E(85289): 2.8e-12 Smith-Waterman score: 461; 35.1% identity (60.3% similar) in 239 aa overlap (12-234:28-265) 10 20 30 40 pF1KE0 MAEYLASIFGTEKDKVNCSFYFKIGACRHGDRCSRLHNKPTFSQ :::..:: :: : :::: :::::: :: :: : XP_005 MLDQAENELENGTTWQNPEPPVDFRVMEKDRANCPFYSKTGACRFGDRCSRKHNFPTSSP 10 20 30 40 50 60 50 60 70 80 90 100 pF1KE0 TIALLNIYRN---PQNSSQSAD-GLRCAVSDVEMQEHYDEFFEEVFTEMEEKYGEVEEMN :. . ... . : .. : :. : ... .:.:.:. :... :.: ... XP_005 TLLIKSMFTTFGMEQCRRDDYDPDASLEYSEEETYQQFLDFYEDVLPEFKN-VGKVIQFK 70 80 90 100 110 110 120 130 140 150 160 pF1KE0 VCDNLGDHLVGNVYVKFRREEDAEKAVIDLNNRWFNGQPIHAELSPVTDFREACCRQYEM : :: :: :::::... ::. . :. .:.::. :. .. :. ::: .. : : .:. XP_005 VSCNLEPHLRGNVYVQYQSEEECQAALSLFNGRWYAGRQLQCEFCPVTRWKMAICGLFEI 120 130 140 150 160 170 170 180 190 200 pF1KE0 GECTRGGFCNFMHL--KPISR--ELRRELYGRRRKKHRSRSRSRERRSR--------SRD .: :: :::.:. .: .. : :..: . : ... ::: : :: XP_005 QQCPRGKHCNFLHVFRNPNNEFWEANRDIYLSPDRTGSSFGKNSERRERMGHHDDYYSRL 180 190 200 210 220 230 210 220 230 240 pF1KE0 RGRGGGGGGGGGGGGRERDRRRSRDRERSGRF ::: . . . . : .:. :: : XP_005 RGRRNPSPDHSYKRNGESERKSSRHRGKKSHKRTSKSRERHNSRSRGRNRDRSRDRSRGR 240 250 260 270 280 290 >>XP_011543892 (OMIM: 300028) PREDICTED: U2 small nuclea (348 aa) initn: 475 init1: 230 opt: 444 Z-score: 341.4 bits: 71.1 E(85289): 2.8e-12 Smith-Waterman score: 461; 35.1% identity (60.3% similar) in 239 aa overlap (12-234:28-265) 10 20 30 40 pF1KE0 MAEYLASIFGTEKDKVNCSFYFKIGACRHGDRCSRLHNKPTFSQ :::..:: :: : :::: :::::: :: :: : XP_011 MLDQAENELENGTTWQNPEPPVDFRVMEKDRANCPFYSKTGACRFGDRCSRKHNFPTSSP 10 20 30 40 50 60 50 60 70 80 90 100 pF1KE0 TIALLNIYRN---PQNSSQSAD-GLRCAVSDVEMQEHYDEFFEEVFTEMEEKYGEVEEMN :. . ... . : .. : :. : ... .:.:.:. :... :.: ... XP_011 TLLIKSMFTTFGMEQCRRDDYDPDASLEYSEEETYQQFLDFYEDVLPEFKN-VGKVIQFK 70 80 90 100 110 110 120 130 140 150 160 pF1KE0 VCDNLGDHLVGNVYVKFRREEDAEKAVIDLNNRWFNGQPIHAELSPVTDFREACCRQYEM : :: :: :::::... ::. . :. .:.::. :. .. :. ::: .. : : .:. XP_011 VSCNLEPHLRGNVYVQYQSEEECQAALSLFNGRWYAGRQLQCEFCPVTRWKMAICGLFEI 120 130 140 150 160 170 170 180 190 200 pF1KE0 GECTRGGFCNFMHL--KPISR--ELRRELYGRRRKKHRSRSRSRERRSR--------SRD .: :: :::.:. .: .. : :..: . : ... ::: : :: XP_011 QQCPRGKHCNFLHVFRNPNNEFWEANRDIYLSPDRTGSSFGKNSERRERMGHHDDYYSRL 180 190 200 210 220 230 210 220 230 240 pF1KE0 RGRGGGGGGGGGGGGRERDRRRSRDRERSGRF ::: . . . . : .:. :: : XP_011 RGRRNPSPDHSYKRNGESERKSSRHRGKKSHKRTSKSRERHNSRSRGRNRDRSRDRSRGR 240 250 260 270 280 290 >>XP_016885372 (OMIM: 300028) PREDICTED: U2 small nuclea (348 aa) initn: 475 init1: 230 opt: 444 Z-score: 341.4 bits: 71.1 E(85289): 2.8e-12 Smith-Waterman score: 461; 35.1% identity (60.3% similar) in 239 aa overlap (12-234:28-265) 10 20 30 40 pF1KE0 MAEYLASIFGTEKDKVNCSFYFKIGACRHGDRCSRLHNKPTFSQ :::..:: :: : :::: :::::: :: :: : XP_016 MLDQAENELENGTTWQNPEPPVDFRVMEKDRANCPFYSKTGACRFGDRCSRKHNFPTSSP 10 20 30 40 50 60 50 60 70 80 90 100 pF1KE0 TIALLNIYRN---PQNSSQSAD-GLRCAVSDVEMQEHYDEFFEEVFTEMEEKYGEVEEMN :. . ... . : .. : :. : ... .:.:.:. :... :.: ... XP_016 TLLIKSMFTTFGMEQCRRDDYDPDASLEYSEEETYQQFLDFYEDVLPEFKN-VGKVIQFK 70 80 90 100 110 110 120 130 140 150 160 pF1KE0 VCDNLGDHLVGNVYVKFRREEDAEKAVIDLNNRWFNGQPIHAELSPVTDFREACCRQYEM : :: :: :::::... ::. . :. .:.::. :. .. :. ::: .. : : .:. XP_016 VSCNLEPHLRGNVYVQYQSEEECQAALSLFNGRWYAGRQLQCEFCPVTRWKMAICGLFEI 120 130 140 150 160 170 170 180 190 200 pF1KE0 GECTRGGFCNFMHL--KPISR--ELRRELYGRRRKKHRSRSRSRERRSR--------SRD .: :: :::.:. .: .. : :..: . : ... ::: : :: XP_016 QQCPRGKHCNFLHVFRNPNNEFWEANRDIYLSPDRTGSSFGKNSERRERMGHHDDYYSRL 180 190 200 210 220 230 210 220 230 240 pF1KE0 RGRGGGGGGGGGGGGRERDRRRSRDRERSGRF ::: . . . . : .:. :: : XP_016 RGRRNPSPDHSYKRNGESERKSSRHRGKKSHKRTSKSRERHNSRSRGRNRDRSRDRSRGR 240 250 260 270 280 290 >>XP_016885371 (OMIM: 300028) PREDICTED: U2 small nuclea (348 aa) initn: 475 init1: 230 opt: 444 Z-score: 341.4 bits: 71.1 E(85289): 2.8e-12 Smith-Waterman score: 461; 35.1% identity (60.3% similar) in 239 aa overlap (12-234:28-265) 10 20 30 40 pF1KE0 MAEYLASIFGTEKDKVNCSFYFKIGACRHGDRCSRLHNKPTFSQ :::..:: :: : :::: :::::: :: :: : XP_016 MLDQAENELENGTTWQNPEPPVDFRVMEKDRANCPFYSKTGACRFGDRCSRKHNFPTSSP 10 20 30 40 50 60 50 60 70 80 90 100 pF1KE0 TIALLNIYRN---PQNSSQSAD-GLRCAVSDVEMQEHYDEFFEEVFTEMEEKYGEVEEMN :. . ... . : .. : :. : ... .:.:.:. :... :.: ... XP_016 TLLIKSMFTTFGMEQCRRDDYDPDASLEYSEEETYQQFLDFYEDVLPEFKN-VGKVIQFK 70 80 90 100 110 110 120 130 140 150 160 pF1KE0 VCDNLGDHLVGNVYVKFRREEDAEKAVIDLNNRWFNGQPIHAELSPVTDFREACCRQYEM : :: :: :::::... ::. . :. .:.::. :. .. :. ::: .. : : .:. XP_016 VSCNLEPHLRGNVYVQYQSEEECQAALSLFNGRWYAGRQLQCEFCPVTRWKMAICGLFEI 120 130 140 150 160 170 170 180 190 200 pF1KE0 GECTRGGFCNFMHL--KPISR--ELRRELYGRRRKKHRSRSRSRERRSR--------SRD .: :: :::.:. .: .. : :..: . : ... ::: : :: XP_016 QQCPRGKHCNFLHVFRNPNNEFWEANRDIYLSPDRTGSSFGKNSERRERMGHHDDYYSRL 180 190 200 210 220 230 210 220 230 240 pF1KE0 RGRGGGGGGGGGGGGRERDRRRSRDRERSGRF ::: . . . . : .:. :: : XP_016 RGRRNPSPDHSYKRNGESERKSSRHRGKKSHKRTSKSRERHNSRSRGRNRDRSRDRSRGR 240 250 260 270 280 290 240 residues in 1 query sequences 60827320 residues in 85289 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Wed Nov 2 18:19:17 2016 done: Wed Nov 2 18:19:18 2016 Total Scan time: 6.120 Total Display time: -0.010 Function used was FASTA [36.3.4 Apr, 2011]