FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KB7516, 246 aa 1>>>pF1KB7516 246 - 246 aa - 246 aa Library: /omim/omim.rfq.tfa 60827320 residues in 85289 sequences Statistics: Expectation_n fit: rho(ln(x))= 7.2297+/-0.000376; mu= 6.5245+/- 0.023 mean_var=153.1489+/-30.348, 0's: 0 Z-trim(117.6): 84 B-trim: 0 in 0/55 Lambda= 0.103638 statistics sampled from 29590 (29675) to 29590 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.719), E-opt: 0.2 (0.348), width: 16 Scan time: 7.160 The best scores are: opt bits E(85289) NP_003192 (OMIM: 600438,617156) transcription fact ( 246) 1643 257.0 2.3e-68 NP_001257711 (OMIM: 600438,617156) transcription f ( 214) 992 159.6 4.1e-39 XP_011538422 (OMIM: 600438,617156) PREDICTED: tran ( 149) 987 158.7 5.2e-39 XP_011538423 (OMIM: 600438,617156) PREDICTED: tran ( 148) 986 158.6 5.8e-39 NP_001137447 (OMIM: 613696) upstream-binding facto ( 393) 210 42.9 0.001 >>NP_003192 (OMIM: 600438,617156) transcription factor A (246 aa) initn: 1643 init1: 1643 opt: 1643 Z-score: 1348.4 bits: 257.0 E(85289): 2.3e-68 Smith-Waterman score: 1643; 100.0% identity (100.0% similar) in 246 aa overlap (1-246:1-246) 10 20 30 40 50 60 pF1KB7 MAFLRSMWGVLSALGRSGAELCTGCGSRLRSPFSFVYLPRWFSSVLASCPKKPVSSYLRF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_003 MAFLRSMWGVLSALGRSGAELCTGCGSRLRSPFSFVYLPRWFSSVLASCPKKPVSSYLRF 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB7 SKEQLPIFKAQNPDAKTTELIRRIAQRWRELPDSKKKIYQDAYRAEWQVYKEEISRFKEQ :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_003 SKEQLPIFKAQNPDAKTTELIRRIAQRWRELPDSKKKIYQDAYRAEWQVYKEEISRFKEQ 70 80 90 100 110 120 130 140 150 160 170 180 pF1KB7 LTPSQIMSLEKEIMDKHLKRKAMTKKKELTLLGKPKRPRSAYNVYVAERFQEAKGDSPQE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_003 LTPSQIMSLEKEIMDKHLKRKAMTKKKELTLLGKPKRPRSAYNVYVAERFQEAKGDSPQE 130 140 150 160 170 180 190 200 210 220 230 240 pF1KB7 KLKTVKENWKNLSDSEKELYIQHAKEDETRYHNEMKSWEEQMIEVGRKDLLRRTIKKQRK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_003 KLKTVKENWKNLSDSEKELYIQHAKEDETRYHNEMKSWEEQMIEVGRKDLLRRTIKKQRK 190 200 210 220 230 240 pF1KB7 YGAEEC :::::: NP_003 YGAEEC >>NP_001257711 (OMIM: 600438,617156) transcription facto (214 aa) initn: 992 init1: 992 opt: 992 Z-score: 823.2 bits: 159.6 E(85289): 4.1e-39 Smith-Waterman score: 1358; 87.0% identity (87.0% similar) in 246 aa overlap (1-246:1-214) 10 20 30 40 50 60 pF1KB7 MAFLRSMWGVLSALGRSGAELCTGCGSRLRSPFSFVYLPRWFSSVLASCPKKPVSSYLRF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 MAFLRSMWGVLSALGRSGAELCTGCGSRLRSPFSFVYLPRWFSSVLASCPKKPVSSYLRF 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB7 SKEQLPIFKAQNPDAKTTELIRRIAQRWRELPDSKKKIYQDAYRAEWQVYKEEISRFKEQ :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 SKEQLPIFKAQNPDAKTTELIRRIAQRWRELPDSKKKIYQDAYRAEWQVYKEEISRFKEQ 70 80 90 100 110 120 130 140 150 160 170 180 pF1KB7 LTPSQIMSLEKEIMDKHLKRKAMTKKKELTLLGKPKRPRSAYNVYVAERFQEAKGDSPQE :::::::::::::::::::::::::::: NP_001 LTPSQIMSLEKEIMDKHLKRKAMTKKKE-------------------------------- 130 140 190 200 210 220 230 240 pF1KB7 KLKTVKENWKNLSDSEKELYIQHAKEDETRYHNEMKSWEEQMIEVGRKDLLRRTIKKQRK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 KLKTVKENWKNLSDSEKELYIQHAKEDETRYHNEMKSWEEQMIEVGRKDLLRRTIKKQRK 150 160 170 180 190 200 pF1KB7 YGAEEC :::::: NP_001 YGAEEC 210 >>XP_011538422 (OMIM: 600438,617156) PREDICTED: transcri (149 aa) initn: 1009 init1: 987 opt: 987 Z-score: 821.3 bits: 158.7 E(85289): 5.2e-39 Smith-Waterman score: 987; 99.3% identity (100.0% similar) in 148 aa overlap (1-148:1-148) 10 20 30 40 50 60 pF1KB7 MAFLRSMWGVLSALGRSGAELCTGCGSRLRSPFSFVYLPRWFSSVLASCPKKPVSSYLRF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_011 MAFLRSMWGVLSALGRSGAELCTGCGSRLRSPFSFVYLPRWFSSVLASCPKKPVSSYLRF 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB7 SKEQLPIFKAQNPDAKTTELIRRIAQRWRELPDSKKKIYQDAYRAEWQVYKEEISRFKEQ :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_011 SKEQLPIFKAQNPDAKTTELIRRIAQRWRELPDSKKKIYQDAYRAEWQVYKEEISRFKEQ 70 80 90 100 110 120 130 140 150 160 170 180 pF1KB7 LTPSQIMSLEKEIMDKHLKRKAMTKKKELTLLGKPKRPRSAYNVYVAERFQEAKGDSPQE :::::::::::::::::::::::::::. XP_011 LTPSQIMSLEKEIMDKHLKRKAMTKKKKS 130 140 >>XP_011538423 (OMIM: 600438,617156) PREDICTED: transcri (148 aa) initn: 986 init1: 986 opt: 986 Z-score: 820.5 bits: 158.6 E(85289): 5.8e-39 Smith-Waterman score: 986; 100.0% identity (100.0% similar) in 147 aa overlap (1-147:1-147) 10 20 30 40 50 60 pF1KB7 MAFLRSMWGVLSALGRSGAELCTGCGSRLRSPFSFVYLPRWFSSVLASCPKKPVSSYLRF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_011 MAFLRSMWGVLSALGRSGAELCTGCGSRLRSPFSFVYLPRWFSSVLASCPKKPVSSYLRF 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB7 SKEQLPIFKAQNPDAKTTELIRRIAQRWRELPDSKKKIYQDAYRAEWQVYKEEISRFKEQ :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_011 SKEQLPIFKAQNPDAKTTELIRRIAQRWRELPDSKKKIYQDAYRAEWQVYKEEISRFKEQ 70 80 90 100 110 120 130 140 150 160 170 180 pF1KB7 LTPSQIMSLEKEIMDKHLKRKAMTKKKELTLLGKPKRPRSAYNVYVAERFQEAKGDSPQE ::::::::::::::::::::::::::: XP_011 LTPSQIMSLEKEIMDKHLKRKAMTKKKS 130 140 >>NP_001137447 (OMIM: 613696) upstream-binding factor 1- (393 aa) initn: 192 init1: 192 opt: 210 Z-score: 187.8 bits: 42.9 E(85289): 0.001 Smith-Waterman score: 256; 23.8% identity (61.9% similar) in 189 aa overlap (50-218:100-287) 20 30 40 50 60 70 pF1KB7 ELCTGCGSRLRSPFSFVYLPRWFSSVLASCPKKPVSSYLRFSKEQLPIFKAQNPDAKTTE ::.:...: :: ::. : .. . : .. : NP_001 FGTLKELVLEAKKCVKKMNKSQKYRNGPDFPKRPLTAYNRFFKESWPQYSQMYPGMRSQE 70 80 90 100 110 120 80 90 100 110 120 130 pF1KB7 LIRRIAQRWRELPDSKKKIYQDAYRAEWQVYKEEISRFKEQLTPSQIMSLEKEIMDKHLK : . .....::::.. :. : . .: : : ..:...::.:. :. ... .: ..:. . NP_001 LTKILSKKYRELPEQMKQKYIQDFRKEKQEFEEKLARFREE-HPDLVQKAKKSSVSKRTQ 130 140 150 160 170 180 140 150 160 170 pF1KB7 RKAMTKK-----------------KELTLLGKPKRP-RSAYNVYVAERF--QEAKGDSPQ :.. : :.. . :.:..: ..:. . . . .: . : . NP_001 NKVQKKFQKNIEEVRSLPKTDRFFKKVKFHGEPQKPPMNGYHKFHQDSWSSKEMQHLSVR 190 200 210 220 230 240 180 190 200 210 220 230 pF1KB7 EKLKTVKENWKNLSDSEKELYIQHAKEDETRYHNEMKSWEEQMIEVGRKDLLRRTIKKQR :.. . . :. . .:.:. . ..:.: . .:. .. : NP_001 ERMVEIGRRWQRIPQSQKDHFKSQAEELQKQYKVKLDLWLKTLSPENYAAYKESTYAKGK 250 260 270 280 290 300 240 pF1KB7 KYGAEEC NP_001 NMAMTGGPDPRLKQADPQSSSAKGLQEGFGEGQGLQAAGTDSSQTIWVNCHVSMEPEENR 310 320 330 340 350 360 246 residues in 1 query sequences 60827320 residues in 85289 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Fri Nov 4 08:22:31 2016 done: Fri Nov 4 08:22:32 2016 Total Scan time: 7.160 Total Display time: -0.010 Function used was FASTA [36.3.4 Apr, 2011]