FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KB5971, 518 aa 1>>>pF1KB5971 518 - 518 aa - 518 aa Library: /omim/omim.rfq.tfa 60827320 residues in 85289 sequences Statistics: Expectation_n fit: rho(ln(x))= 6.8652+/-0.00035; mu= 11.0970+/- 0.022 mean_var=115.4235+/-23.042, 0's: 0 Z-trim(117.4): 29 B-trim: 197 in 1/56 Lambda= 0.119379 statistics sampled from 29340 (29369) to 29340 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.712), E-opt: 0.2 (0.344), width: 16 Scan time: 10.680 The best scores are: opt bits E(85289) NP_055296 (OMIM: 300773) DNA-(apurinic or apyrimid ( 518) 3553 623.0 6.5e-178 NP_001258677 (OMIM: 300773) DNA-(apurinic or apyri ( 347) 2411 426.2 7.5e-119 NP_542379 (OMIM: 107748) DNA-(apurinic or apyrimid ( 318) 187 43.2 0.0014 NP_001632 (OMIM: 107748) DNA-(apurinic or apyrimid ( 318) 187 43.2 0.0014 NP_001231178 (OMIM: 107748) DNA-(apurinic or apyri ( 318) 187 43.2 0.0014 NP_542380 (OMIM: 107748) DNA-(apurinic or apyrimid ( 318) 187 43.2 0.0014 >>NP_055296 (OMIM: 300773) DNA-(apurinic or apyrimidinic (518 aa) initn: 3553 init1: 3553 opt: 3553 Z-score: 3315.2 bits: 623.0 E(85289): 6.5e-178 Smith-Waterman score: 3553; 100.0% identity (100.0% similar) in 518 aa overlap (1-518:1-518) 10 20 30 40 50 60 pF1KB5 MLRVVSWNINGIRRPLQGVANQEPSNCAAVAVGRILDELDADIVCLQETKVTRDALTEPL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_055 MLRVVSWNINGIRRPLQGVANQEPSNCAAVAVGRILDELDADIVCLQETKVTRDALTEPL 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB5 AIVEGYNSYFSFSRNRSGYSGVATFCKDNATPVAAEEGLSGLFATQNGDVGCYGNMDEFT :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_055 AIVEGYNSYFSFSRNRSGYSGVATFCKDNATPVAAEEGLSGLFATQNGDVGCYGNMDEFT 70 80 90 100 110 120 130 140 150 160 170 180 pF1KB5 QEELRALDSEGRALLTQHKIRTWEGKEKTLTLINVYCPHADPGRPERLVFKMRFYRLLQI :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_055 QEELRALDSEGRALLTQHKIRTWEGKEKTLTLINVYCPHADPGRPERLVFKMRFYRLLQI 130 140 150 160 170 180 190 200 210 220 230 240 pF1KB5 RAEALLAAGSHVIILGDLNTAHRPIDHWDAVNLECFEEDPGRKWMDSLLSNLGCQSASHV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_055 RAEALLAAGSHVIILGDLNTAHRPIDHWDAVNLECFEEDPGRKWMDSLLSNLGCQSASHV 190 200 210 220 230 240 250 260 270 280 290 300 pF1KB5 GPFIDSYRCFQPKQEGAFTCWSAVTGARHLNYGSRLDYVLGDRTLVIDTFQASFLLPEVM :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_055 GPFIDSYRCFQPKQEGAFTCWSAVTGARHLNYGSRLDYVLGDRTLVIDTFQASFLLPEVM 250 260 270 280 290 300 310 320 330 340 350 360 pF1KB5 GSDHCPVGAVLSVSSVPAKQCPPLCTRFLPEFAGTQLKILRFLVPLEQSPVLEQSTLQHN :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_055 GSDHCPVGAVLSVSSVPAKQCPPLCTRFLPEFAGTQLKILRFLVPLEQSPVLEQSTLQHN 310 320 330 340 350 360 370 380 390 400 410 420 pF1KB5 NQTRVQTCQNKAQVRSTRPQPSQVGSSRGQKNLKSYFQPSPSCPQASPDIELPSLPLMSA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_055 NQTRVQTCQNKAQVRSTRPQPSQVGSSRGQKNLKSYFQPSPSCPQASPDIELPSLPLMSA 370 380 390 400 410 420 430 440 450 460 470 480 pF1KB5 LMTPKTPEEKAVAKVVKGQAKTSEAKDEKELRTSFWKSVLAGPLRTPLCGGHREPCVMRT :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_055 LMTPKTPEEKAVAKVVKGQAKTSEAKDEKELRTSFWKSVLAGPLRTPLCGGHREPCVMRT 430 440 450 460 470 480 490 500 510 pF1KB5 VKKPGPNLGRRFYMCARPRGPPTDPSSRCNFFLWSRPS :::::::::::::::::::::::::::::::::::::: NP_055 VKKPGPNLGRRFYMCARPRGPPTDPSSRCNFFLWSRPS 490 500 510 >>NP_001258677 (OMIM: 300773) DNA-(apurinic or apyrimidi (347 aa) initn: 2411 init1: 2411 opt: 2411 Z-score: 2254.7 bits: 426.2 E(85289): 7.5e-119 Smith-Waterman score: 2411; 100.0% identity (100.0% similar) in 347 aa overlap (172-518:1-347) 150 160 170 180 190 200 pF1KB5 TWEGKEKTLTLINVYCPHADPGRPERLVFKMRFYRLLQIRAEALLAAGSHVIILGDLNTA :::::::::::::::::::::::::::::: NP_001 MRFYRLLQIRAEALLAAGSHVIILGDLNTA 10 20 30 210 220 230 240 250 260 pF1KB5 HRPIDHWDAVNLECFEEDPGRKWMDSLLSNLGCQSASHVGPFIDSYRCFQPKQEGAFTCW :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 HRPIDHWDAVNLECFEEDPGRKWMDSLLSNLGCQSASHVGPFIDSYRCFQPKQEGAFTCW 40 50 60 70 80 90 270 280 290 300 310 320 pF1KB5 SAVTGARHLNYGSRLDYVLGDRTLVIDTFQASFLLPEVMGSDHCPVGAVLSVSSVPAKQC :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 SAVTGARHLNYGSRLDYVLGDRTLVIDTFQASFLLPEVMGSDHCPVGAVLSVSSVPAKQC 100 110 120 130 140 150 330 340 350 360 370 380 pF1KB5 PPLCTRFLPEFAGTQLKILRFLVPLEQSPVLEQSTLQHNNQTRVQTCQNKAQVRSTRPQP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 PPLCTRFLPEFAGTQLKILRFLVPLEQSPVLEQSTLQHNNQTRVQTCQNKAQVRSTRPQP 160 170 180 190 200 210 390 400 410 420 430 440 pF1KB5 SQVGSSRGQKNLKSYFQPSPSCPQASPDIELPSLPLMSALMTPKTPEEKAVAKVVKGQAK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 SQVGSSRGQKNLKSYFQPSPSCPQASPDIELPSLPLMSALMTPKTPEEKAVAKVVKGQAK 220 230 240 250 260 270 450 460 470 480 490 500 pF1KB5 TSEAKDEKELRTSFWKSVLAGPLRTPLCGGHREPCVMRTVKKPGPNLGRRFYMCARPRGP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 TSEAKDEKELRTSFWKSVLAGPLRTPLCGGHREPCVMRTVKKPGPNLGRRFYMCARPRGP 280 290 300 310 320 330 510 pF1KB5 PTDPSSRCNFFLWSRPS ::::::::::::::::: NP_001 PTDPSSRCNFFLWSRPS 340 >>NP_542379 (OMIM: 107748) DNA-(apurinic or apyrimidinic (318 aa) initn: 276 init1: 107 opt: 187 Z-score: 185.2 bits: 43.2 E(85289): 0.0014 Smith-Waterman score: 285; 26.9% identity (51.9% similar) in 320 aa overlap (2-313:62-318) 10 20 pF1KB5 MLRVVSWNINGIR-----RPLQGVANQEPSN :.. :::..:.: . :. : .. : NP_542 KNDKEAAGEGPALYEDPPDQKTSPSGKPATLKICSWNVDGLRAWIKKKGLDWVKEEAP-- 40 50 60 70 80 30 40 50 60 70 80 pF1KB5 CAAVAVGRILDELDADIVCLQETKVTRDALTEPLAIVEGYN-SYFSFSRNRSGYSGVATF ::.:::::: ... : : . : . .:.: .. ::::: NP_542 ---------------DILCLQETKCSENKLPAELQELPGLSHQYWSAPSDKEGYSGV--- 90 100 110 120 130 90 100 110 120 130 140 pF1KB5 CKDNATPVAAEEGLSGLFATQNGDVGCYGNMDEFTQEELRALDSEGRALLTQHKIRTWEG ::.. : :: :: :. :.:::..... NP_542 ---------------GLLSRQCPLKVSYGIGDE---EH----DQEGRVIVAEF------- 140 150 160 150 160 170 180 190 200 pF1KB5 KEKTLTLINVYCPHADPGRPERLVFKMRFYRLLQIRAEALLAAGSHVIILGDLNTAHRPI ...:...: :.: : :: ...:. . .. ..: :. . ... ::::.::. : NP_542 --DSFVLVTAYVPNAGRGLV-RLEYRQRWDEAFRKFLKGL-ASRKPLVLCGDLNVAHEEI 170 180 190 200 210 210 220 230 240 250 260 pF1KB5 DHWDAVNLECFEEDPGRKWMDSLLSN--LGCQSASHVGPFIDSYRCFQPKQEGAFTCWSA : ... : : .. . : .. :. ::.: . :. :.: :. NP_542 D---------LRNPKGNKKNAGFTPQERQGFGELLQAVPLADSFRHLYPNTPYAYTFWTY 220 230 240 250 260 270 280 290 300 310 320 pF1KB5 VTGARHLNYGSRLDYVLGDRTLVIDTFQASFLLPEVMGSDHCPVGAVLSVSSVPAKQCPP . .:: : : :::: : ...: . .. : . ...::::::. :.. NP_542 MMNARSKNVGWRLDYFLLSHSL-LPALCDSKIRSKALGSDHCPITLYLAL 270 280 290 300 310 330 340 350 360 370 380 pF1KB5 LCTRFLPEFAGTQLKILRFLVPLEQSPVLEQSTLQHNNQTRVQTCQNKAQVRSTRPQPSQ >>NP_001632 (OMIM: 107748) DNA-(apurinic or apyrimidinic (318 aa) initn: 276 init1: 107 opt: 187 Z-score: 185.2 bits: 43.2 E(85289): 0.0014 Smith-Waterman score: 285; 26.9% identity (51.9% similar) in 320 aa overlap (2-313:62-318) 10 20 pF1KB5 MLRVVSWNINGIR-----RPLQGVANQEPSN :.. :::..:.: . :. : .. : NP_001 KNDKEAAGEGPALYEDPPDQKTSPSGKPATLKICSWNVDGLRAWIKKKGLDWVKEEAP-- 40 50 60 70 80 30 40 50 60 70 80 pF1KB5 CAAVAVGRILDELDADIVCLQETKVTRDALTEPLAIVEGYN-SYFSFSRNRSGYSGVATF ::.:::::: ... : : . : . .:.: .. ::::: NP_001 ---------------DILCLQETKCSENKLPAELQELPGLSHQYWSAPSDKEGYSGV--- 90 100 110 120 130 90 100 110 120 130 140 pF1KB5 CKDNATPVAAEEGLSGLFATQNGDVGCYGNMDEFTQEELRALDSEGRALLTQHKIRTWEG ::.. : :: :: :. :.:::..... NP_001 ---------------GLLSRQCPLKVSYGIGDE---EH----DQEGRVIVAEF------- 140 150 160 150 160 170 180 190 200 pF1KB5 KEKTLTLINVYCPHADPGRPERLVFKMRFYRLLQIRAEALLAAGSHVIILGDLNTAHRPI ...:...: :.: : :: ...:. . .. ..: :. . ... ::::.::. : NP_001 --DSFVLVTAYVPNAGRGLV-RLEYRQRWDEAFRKFLKGL-ASRKPLVLCGDLNVAHEEI 170 180 190 200 210 210 220 230 240 250 260 pF1KB5 DHWDAVNLECFEEDPGRKWMDSLLSN--LGCQSASHVGPFIDSYRCFQPKQEGAFTCWSA : ... : : .. . : .. :. ::.: . :. :.: :. NP_001 D---------LRNPKGNKKNAGFTPQERQGFGELLQAVPLADSFRHLYPNTPYAYTFWTY 220 230 240 250 260 270 280 290 300 310 320 pF1KB5 VTGARHLNYGSRLDYVLGDRTLVIDTFQASFLLPEVMGSDHCPVGAVLSVSSVPAKQCPP . .:: : : :::: : ...: . .. : . ...::::::. :.. NP_001 MMNARSKNVGWRLDYFLLSHSL-LPALCDSKIRSKALGSDHCPITLYLAL 270 280 290 300 310 330 340 350 360 370 380 pF1KB5 LCTRFLPEFAGTQLKILRFLVPLEQSPVLEQSTLQHNNQTRVQTCQNKAQVRSTRPQPSQ >>NP_001231178 (OMIM: 107748) DNA-(apurinic or apyrimidi (318 aa) initn: 276 init1: 107 opt: 187 Z-score: 185.2 bits: 43.2 E(85289): 0.0014 Smith-Waterman score: 285; 26.9% identity (51.9% similar) in 320 aa overlap (2-313:62-318) 10 20 pF1KB5 MLRVVSWNINGIR-----RPLQGVANQEPSN :.. :::..:.: . :. : .. : NP_001 KNDKEAAGEGPALYEDPPDQKTSPSGKPATLKICSWNVDGLRAWIKKKGLDWVKEEAP-- 40 50 60 70 80 30 40 50 60 70 80 pF1KB5 CAAVAVGRILDELDADIVCLQETKVTRDALTEPLAIVEGYN-SYFSFSRNRSGYSGVATF ::.:::::: ... : : . : . .:.: .. ::::: NP_001 ---------------DILCLQETKCSENKLPAELQELPGLSHQYWSAPSDKEGYSGV--- 90 100 110 120 130 90 100 110 120 130 140 pF1KB5 CKDNATPVAAEEGLSGLFATQNGDVGCYGNMDEFTQEELRALDSEGRALLTQHKIRTWEG ::.. : :: :: :. :.:::..... NP_001 ---------------GLLSRQCPLKVSYGIGDE---EH----DQEGRVIVAEF------- 140 150 160 150 160 170 180 190 200 pF1KB5 KEKTLTLINVYCPHADPGRPERLVFKMRFYRLLQIRAEALLAAGSHVIILGDLNTAHRPI ...:...: :.: : :: ...:. . .. ..: :. . ... ::::.::. : NP_001 --DSFVLVTAYVPNAGRGLV-RLEYRQRWDEAFRKFLKGL-ASRKPLVLCGDLNVAHEEI 170 180 190 200 210 210 220 230 240 250 260 pF1KB5 DHWDAVNLECFEEDPGRKWMDSLLSN--LGCQSASHVGPFIDSYRCFQPKQEGAFTCWSA : ... : : .. . : .. :. ::.: . :. :.: :. NP_001 D---------LRNPKGNKKNAGFTPQERQGFGELLQAVPLADSFRHLYPNTPYAYTFWTY 220 230 240 250 260 270 280 290 300 310 320 pF1KB5 VTGARHLNYGSRLDYVLGDRTLVIDTFQASFLLPEVMGSDHCPVGAVLSVSSVPAKQCPP . .:: : : :::: : ...: . .. : . ...::::::. :.. NP_001 MMNARSKNVGWRLDYFLLSHSL-LPALCDSKIRSKALGSDHCPITLYLAL 270 280 290 300 310 330 340 350 360 370 380 pF1KB5 LCTRFLPEFAGTQLKILRFLVPLEQSPVLEQSTLQHNNQTRVQTCQNKAQVRSTRPQPSQ >>NP_542380 (OMIM: 107748) DNA-(apurinic or apyrimidinic (318 aa) initn: 276 init1: 107 opt: 187 Z-score: 185.2 bits: 43.2 E(85289): 0.0014 Smith-Waterman score: 285; 26.9% identity (51.9% similar) in 320 aa overlap (2-313:62-318) 10 20 pF1KB5 MLRVVSWNINGIR-----RPLQGVANQEPSN :.. :::..:.: . :. : .. : NP_542 KNDKEAAGEGPALYEDPPDQKTSPSGKPATLKICSWNVDGLRAWIKKKGLDWVKEEAP-- 40 50 60 70 80 30 40 50 60 70 80 pF1KB5 CAAVAVGRILDELDADIVCLQETKVTRDALTEPLAIVEGYN-SYFSFSRNRSGYSGVATF ::.:::::: ... : : . : . .:.: .. ::::: NP_542 ---------------DILCLQETKCSENKLPAELQELPGLSHQYWSAPSDKEGYSGV--- 90 100 110 120 130 90 100 110 120 130 140 pF1KB5 CKDNATPVAAEEGLSGLFATQNGDVGCYGNMDEFTQEELRALDSEGRALLTQHKIRTWEG ::.. : :: :: :. :.:::..... NP_542 ---------------GLLSRQCPLKVSYGIGDE---EH----DQEGRVIVAEF------- 140 150 160 150 160 170 180 190 200 pF1KB5 KEKTLTLINVYCPHADPGRPERLVFKMRFYRLLQIRAEALLAAGSHVIILGDLNTAHRPI ...:...: :.: : :: ...:. . .. ..: :. . ... ::::.::. : NP_542 --DSFVLVTAYVPNAGRGLV-RLEYRQRWDEAFRKFLKGL-ASRKPLVLCGDLNVAHEEI 170 180 190 200 210 210 220 230 240 250 260 pF1KB5 DHWDAVNLECFEEDPGRKWMDSLLSN--LGCQSASHVGPFIDSYRCFQPKQEGAFTCWSA : ... : : .. . : .. :. ::.: . :. :.: :. NP_542 D---------LRNPKGNKKNAGFTPQERQGFGELLQAVPLADSFRHLYPNTPYAYTFWTY 220 230 240 250 260 270 280 290 300 310 320 pF1KB5 VTGARHLNYGSRLDYVLGDRTLVIDTFQASFLLPEVMGSDHCPVGAVLSVSSVPAKQCPP . .:: : : :::: : ...: . .. : . ...::::::. :.. NP_542 MMNARSKNVGWRLDYFLLSHSL-LPALCDSKIRSKALGSDHCPITLYLAL 270 280 290 300 310 330 340 350 360 370 380 pF1KB5 LCTRFLPEFAGTQLKILRFLVPLEQSPVLEQSTLQHNNQTRVQTCQNKAQVRSTRPQPSQ 518 residues in 1 query sequences 60827320 residues in 85289 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Fri Nov 4 21:41:24 2016 done: Fri Nov 4 21:41:25 2016 Total Scan time: 10.680 Total Display time: -0.010 Function used was FASTA [36.3.4 Apr, 2011]