FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE2657, 197 aa 1>>>pF1KE2657 197 - 197 aa - 197 aa Library: /omim/omim.rfq.tfa 60892289 residues in 85410 sequences Statistics: Expectation_n fit: rho(ln(x))= 4.7688+/-0.000286; mu= 16.5449+/- 0.018 mean_var=62.1549+/-12.607, 0's: 0 Z-trim(118.0): 11 B-trim: 0 in 0/53 Lambda= 0.162681 statistics sampled from 30508 (30519) to 30508 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.734), E-opt: 0.2 (0.357), width: 16 Scan time: 6.320 The best scores are: opt bits E(85410) NP_003009 (OMIM: 178500,178620,610913) pulmonary s ( 197) 1287 309.8 1.8e-84 NP_001165881 (OMIM: 178500,178620,610913) pulmonar ( 197) 1287 309.8 1.8e-84 NP_001165828 (OMIM: 178500,178620,610913) pulmonar ( 191) 1222 294.6 6.9e-80 NP_001304709 (OMIM: 178500,178620,610913) pulmonar ( 191) 1222 294.6 6.9e-80 NP_001304707 (OMIM: 178500,178620,610913) pulmonar ( 191) 1222 294.6 6.9e-80 NP_001304708 (OMIM: 178500,178620,610913) pulmonar ( 144) 845 206.0 2.4e-53 XP_011542915 (OMIM: 178500,178620,610913) PREDICTE ( 144) 845 206.0 2.4e-53 NP_001011705 (OMIM: 605147) leukocyte cell-derived ( 333) 148 42.7 0.0008 NP_008946 (OMIM: 605147) leukocyte cell-derived ch ( 334) 148 42.7 0.0008 XP_011533200 (OMIM: 605147) PREDICTED: leukocyte c ( 364) 148 42.7 0.00086 XP_011533199 (OMIM: 605147) PREDICTED: leukocyte c ( 365) 148 42.7 0.00086 >>NP_003009 (OMIM: 178500,178620,610913) pulmonary surfa (197 aa) initn: 1287 init1: 1287 opt: 1287 Z-score: 1637.5 bits: 309.8 E(85410): 1.8e-84 Smith-Waterman score: 1287; 99.0% identity (100.0% similar) in 197 aa overlap (1-197:1-197) 10 20 30 40 50 60 pF1KE2 MDVGSKEVLMESPPDYSAAPRGRFGIPCCPVHLKRLLIVVVVVVLIVVVIVGALLMGLHM :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_003 MDVGSKEVLMESPPDYSAAPRGRFGIPCCPVHLKRLLIVVVVVVLIVVVIVGALLMGLHM 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE2 SQKHTEMVLEMSIGAPEAQQRLALSEHLVTTATFSIGSTGLVVYDYQQLLIAYKPAPGTC :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_003 SQKHTEMVLEMSIGAPEAQQRLALSEHLVTTATFSIGSTGLVVYDYQQLLIAYKPAPGTC 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE2 CYIMKIAPESIPSLEALNRKVHNFQMECSLQAKPAVPTSKLGQAEGRDAGSAPSGGDPAF :::::::::::::::::.:::::::::::::::::::::::::::::::::::::::::: NP_003 CYIMKIAPESIPSLEALTRKVHNFQMECSLQAKPAVPTSKLGQAEGRDAGSAPSGGDPAF 130 140 150 160 170 180 190 pF1KE2 LGMAVNTLCGEVPLYYI :::::.::::::::::: NP_003 LGMAVSTLCGEVPLYYI 190 >>NP_001165881 (OMIM: 178500,178620,610913) pulmonary su (197 aa) initn: 1287 init1: 1287 opt: 1287 Z-score: 1637.5 bits: 309.8 E(85410): 1.8e-84 Smith-Waterman score: 1287; 99.0% identity (100.0% similar) in 197 aa overlap (1-197:1-197) 10 20 30 40 50 60 pF1KE2 MDVGSKEVLMESPPDYSAAPRGRFGIPCCPVHLKRLLIVVVVVVLIVVVIVGALLMGLHM :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 MDVGSKEVLMESPPDYSAAPRGRFGIPCCPVHLKRLLIVVVVVVLIVVVIVGALLMGLHM 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE2 SQKHTEMVLEMSIGAPEAQQRLALSEHLVTTATFSIGSTGLVVYDYQQLLIAYKPAPGTC :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 SQKHTEMVLEMSIGAPEAQQRLALSEHLVTTATFSIGSTGLVVYDYQQLLIAYKPAPGTC 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE2 CYIMKIAPESIPSLEALNRKVHNFQMECSLQAKPAVPTSKLGQAEGRDAGSAPSGGDPAF :::::::::::::::::.:::::::::::::::::::::::::::::::::::::::::: NP_001 CYIMKIAPESIPSLEALTRKVHNFQMECSLQAKPAVPTSKLGQAEGRDAGSAPSGGDPAF 130 140 150 160 170 180 190 pF1KE2 LGMAVNTLCGEVPLYYI :::::.::::::::::: NP_001 LGMAVSTLCGEVPLYYI 190 >>NP_001165828 (OMIM: 178500,178620,610913) pulmonary su (191 aa) initn: 939 init1: 939 opt: 1222 Z-score: 1555.3 bits: 294.6 E(85410): 6.9e-80 Smith-Waterman score: 1222; 95.9% identity (97.0% similar) in 197 aa overlap (1-197:1-191) 10 20 30 40 50 60 pF1KE2 MDVGSKEVLMESPPDYSAAPRGRFGIPCCPVHLKRLLIVVVVVVLIVVVIVGALLMGLHM :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 MDVGSKEVLMESPPDYSAAPRGRFGIPCCPVHLKRLLIVVVVVVLIVVVIVGALLMGLHM 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE2 SQKHTEMVLEMSIGAPEAQQRLALSEHLVTTATFSIGSTGLVVYDYQQLLIAYKPAPGTC :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 SQKHTEMVLEMSIGAPEAQQRLALSEHLVTTATFSIGSTGLVVYDYQQLLIAYKPAPGTC 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE2 CYIMKIAPESIPSLEALNRKVHNFQMECSLQAKPAVPTSKLGQAEGRDAGSAPSGGDPAF :::::::::::::::::.::::::: ::::::::::::::::::::::::::::: NP_001 CYIMKIAPESIPSLEALTRKVHNFQ------AKPAVPTSKLGQAEGRDAGSAPSGGDPAF 130 140 150 160 170 190 pF1KE2 LGMAVNTLCGEVPLYYI :::::.::::::::::: NP_001 LGMAVSTLCGEVPLYYI 180 190 >>NP_001304709 (OMIM: 178500,178620,610913) pulmonary su (191 aa) initn: 939 init1: 939 opt: 1222 Z-score: 1555.3 bits: 294.6 E(85410): 6.9e-80 Smith-Waterman score: 1222; 95.9% identity (97.0% similar) in 197 aa overlap (1-197:1-191) 10 20 30 40 50 60 pF1KE2 MDVGSKEVLMESPPDYSAAPRGRFGIPCCPVHLKRLLIVVVVVVLIVVVIVGALLMGLHM :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 MDVGSKEVLMESPPDYSAAPRGRFGIPCCPVHLKRLLIVVVVVVLIVVVIVGALLMGLHM 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE2 SQKHTEMVLEMSIGAPEAQQRLALSEHLVTTATFSIGSTGLVVYDYQQLLIAYKPAPGTC :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 SQKHTEMVLEMSIGAPEAQQRLALSEHLVTTATFSIGSTGLVVYDYQQLLIAYKPAPGTC 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE2 CYIMKIAPESIPSLEALNRKVHNFQMECSLQAKPAVPTSKLGQAEGRDAGSAPSGGDPAF :::::::::::::::::.::::::: ::::::::::::::::::::::::::::: NP_001 CYIMKIAPESIPSLEALTRKVHNFQ------AKPAVPTSKLGQAEGRDAGSAPSGGDPAF 130 140 150 160 170 190 pF1KE2 LGMAVNTLCGEVPLYYI :::::.::::::::::: NP_001 LGMAVSTLCGEVPLYYI 180 190 >>NP_001304707 (OMIM: 178500,178620,610913) pulmonary su (191 aa) initn: 939 init1: 939 opt: 1222 Z-score: 1555.3 bits: 294.6 E(85410): 6.9e-80 Smith-Waterman score: 1222; 95.9% identity (97.0% similar) in 197 aa overlap (1-197:1-191) 10 20 30 40 50 60 pF1KE2 MDVGSKEVLMESPPDYSAAPRGRFGIPCCPVHLKRLLIVVVVVVLIVVVIVGALLMGLHM :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 MDVGSKEVLMESPPDYSAAPRGRFGIPCCPVHLKRLLIVVVVVVLIVVVIVGALLMGLHM 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE2 SQKHTEMVLEMSIGAPEAQQRLALSEHLVTTATFSIGSTGLVVYDYQQLLIAYKPAPGTC :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 SQKHTEMVLEMSIGAPEAQQRLALSEHLVTTATFSIGSTGLVVYDYQQLLIAYKPAPGTC 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE2 CYIMKIAPESIPSLEALNRKVHNFQMECSLQAKPAVPTSKLGQAEGRDAGSAPSGGDPAF :::::::::::::::::.::::::: ::::::::::::::::::::::::::::: NP_001 CYIMKIAPESIPSLEALTRKVHNFQ------AKPAVPTSKLGQAEGRDAGSAPSGGDPAF 130 140 150 160 170 190 pF1KE2 LGMAVNTLCGEVPLYYI :::::.::::::::::: NP_001 LGMAVSTLCGEVPLYYI 180 190 >>NP_001304708 (OMIM: 178500,178620,610913) pulmonary su (144 aa) initn: 924 init1: 845 opt: 845 Z-score: 1078.8 bits: 206.0 E(85410): 2.4e-53 Smith-Waterman score: 845; 98.5% identity (100.0% similar) in 130 aa overlap (68-197:15-144) 40 50 60 70 80 90 pF1KE2 IVVVVVVLIVVVIVGALLMGLHMSQKHTEMVLEMSIGAPEAQQRLALSEHLVTTATFSIG :::::::::::::::::::::::::::::: NP_001 MDVGSKEVLMESPPVLEMSIGAPEAQQRLALSEHLVTTATFSIG 10 20 30 40 100 110 120 130 140 150 pF1KE2 STGLVVYDYQQLLIAYKPAPGTCCYIMKIAPESIPSLEALNRKVHNFQMECSLQAKPAVP ::::::::::::::::::::::::::::::::::::::::.::::::::::::::::::: NP_001 STGLVVYDYQQLLIAYKPAPGTCCYIMKIAPESIPSLEALTRKVHNFQMECSLQAKPAVP 50 60 70 80 90 100 160 170 180 190 pF1KE2 TSKLGQAEGRDAGSAPSGGDPAFLGMAVNTLCGEVPLYYI ::::::::::::::::::::::::::::.::::::::::: NP_001 TSKLGQAEGRDAGSAPSGGDPAFLGMAVSTLCGEVPLYYI 110 120 130 140 >>XP_011542915 (OMIM: 178500,178620,610913) PREDICTED: p (144 aa) initn: 924 init1: 845 opt: 845 Z-score: 1078.8 bits: 206.0 E(85410): 2.4e-53 Smith-Waterman score: 845; 98.5% identity (100.0% similar) in 130 aa overlap (68-197:15-144) 40 50 60 70 80 90 pF1KE2 IVVVVVVLIVVVIVGALLMGLHMSQKHTEMVLEMSIGAPEAQQRLALSEHLVTTATFSIG :::::::::::::::::::::::::::::: XP_011 MDVGSKEVLMESPPVLEMSIGAPEAQQRLALSEHLVTTATFSIG 10 20 30 40 100 110 120 130 140 150 pF1KE2 STGLVVYDYQQLLIAYKPAPGTCCYIMKIAPESIPSLEALNRKVHNFQMECSLQAKPAVP ::::::::::::::::::::::::::::::::::::::::.::::::::::::::::::: XP_011 STGLVVYDYQQLLIAYKPAPGTCCYIMKIAPESIPSLEALTRKVHNFQMECSLQAKPAVP 50 60 70 80 90 100 160 170 180 190 pF1KE2 TSKLGQAEGRDAGSAPSGGDPAFLGMAVNTLCGEVPLYYI ::::::::::::::::::::::::::::.::::::::::: XP_011 TSKLGQAEGRDAGSAPSGGDPAFLGMAVSTLCGEVPLYYI 110 120 130 140 >>NP_001011705 (OMIM: 605147) leukocyte cell-derived che (333 aa) initn: 115 init1: 59 opt: 148 Z-score: 189.6 bits: 42.7 E(85410): 0.0008 Smith-Waterman score: 148; 24.9% identity (55.2% similar) in 201 aa overlap (3-197:13-201) 10 20 30 40 pF1KE2 MDVGSKEVLMESPPDYSAAPRGRFGIPCCPVHLKRLLIVVVV--VVLIVV :: .: . ::: :.. : :..: .. ::.. .::.. NP_001 MTENSDKVPIALVGPDDVEFCSPPAYATLTVK----PSSPARLLKVGAVVLISGAVLLLF 10 20 30 40 50 50 60 70 80 90 100 pF1KE2 VIVGALLMGLHMSQKHTEMV-LEMSIGAPEAQQRLALSEHLVTTATFSIGS---TGLVVY .::. . . :..: : :::.. . . .. . ::..:: ...: NP_001 GAIGAFYF-WKGSDSHIYNVHYTMSINGKLQDGSMEIDAG-NNLETFKMGSGAEEAIAVN 60 70 80 90 100 110 110 120 130 140 150 160 pF1KE2 DYQQLLIAYKPAPGTCCYIMKIAPESIPSLEALNRKVHNFQMECSLQAKPAVPTSKLGQA :.:. . . . : : ::: . :: . :.... .. .:..: .:.. .. NP_001 DFQNGITGIRFAGGEKCYIKAQVKARIPEVGAVTKQ----SISSKLEGK-IMPVKYEENS 120 130 140 150 160 170 180 190 pF1KE2 EGRDAGSAPSGGDPAFLGMAVNTLCGEVPLYYI : . : : .::. : :::..:.... NP_001 LIWVAVDQPVK-DNSFLSSKVLELCGDLPIFWLKPTYPKEIQRERREVVRKIVPTTTKRP 170 180 190 200 210 220 NP_001 HSGPRSNPGAGRLNNETRPSVQEDSQAFNPDNPYHQEGESMTFDPRLDHEGICCIECRRS 230 240 250 260 270 280 >>NP_008946 (OMIM: 605147) leukocyte cell-derived chemot (334 aa) initn: 115 init1: 59 opt: 148 Z-score: 189.6 bits: 42.7 E(85410): 0.0008 Smith-Waterman score: 148; 24.9% identity (55.2% similar) in 201 aa overlap (3-197:13-201) 10 20 30 40 pF1KE2 MDVGSKEVLMESPPDYSAAPRGRFGIPCCPVHLKRLLIVVVV--VVLIVV :: .: . ::: :.. : :..: .. ::.. .::.. NP_008 MTENSDKVPIALVGPDDVEFCSPPAYATLTVK----PSSPARLLKVGAVVLISGAVLLLF 10 20 30 40 50 50 60 70 80 90 100 pF1KE2 VIVGALLMGLHMSQKHTEMV-LEMSIGAPEAQQRLALSEHLVTTATFSIGS---TGLVVY .::. . . :..: : :::.. . . .. . ::..:: ...: NP_008 GAIGAFYF-WKGSDSHIYNVHYTMSINGKLQDGSMEIDAG-NNLETFKMGSGAEEAIAVN 60 70 80 90 100 110 110 120 130 140 150 160 pF1KE2 DYQQLLIAYKPAPGTCCYIMKIAPESIPSLEALNRKVHNFQMECSLQAKPAVPTSKLGQA :.:. . . . : : ::: . :: . :.... .. .:..: .:.. .. NP_008 DFQNGITGIRFAGGEKCYIKAQVKARIPEVGAVTKQ----SISSKLEGK-IMPVKYEENS 120 130 140 150 160 170 180 190 pF1KE2 EGRDAGSAPSGGDPAFLGMAVNTLCGEVPLYYI : . : : .::. : :::..:.... NP_008 LIWVAVDQPVK-DNSFLSSKVLELCGDLPIFWLKPTYPKEIQRERREVVRKIVPTTTKRP 170 180 190 200 210 220 NP_008 HSGPRSNPGAGRLNNETRPSVQEDSQAFNPDNPYHQQEGESMTFDPRLDHEGICCIECRR 230 240 250 260 270 280 >>XP_011533200 (OMIM: 605147) PREDICTED: leukocyte cell- (364 aa) initn: 115 init1: 59 opt: 148 Z-score: 189.1 bits: 42.7 E(85410): 0.00086 Smith-Waterman score: 148; 24.9% identity (55.2% similar) in 201 aa overlap (3-197:40-228) 10 20 30 pF1KE2 MDVGSKEVLMESPPDYSAAPRGRFGIPCCPVH :: .: . ::: :.. : :.. XP_011 RSQGAQGVSKCLTPPAANMTENSDKVPIALVGPDDVEFCSPPAYATLTVK----PSSPAR 10 20 30 40 50 60 40 50 60 70 80 pF1KE2 LKRLLIVVVV--VVLIVVVIVGALLMGLHMSQKHTEMV-LEMSIGAPEAQQRLALSEHLV : .. ::.. .::.. .::. . . :..: : :::.. . . .. XP_011 LLKVGAVVLISGAVLLLFGAIGAFYF-WKGSDSHIYNVHYTMSINGKLQDGSMEIDAG-N 70 80 90 100 110 120 90 100 110 120 130 140 pF1KE2 TTATFSIGS---TGLVVYDYQQLLIAYKPAPGTCCYIMKIAPESIPSLEALNRKVHNFQM . ::..:: ...: :.:. . . . : : ::: . :: . :.... .. XP_011 NLETFKMGSGAEEAIAVNDFQNGITGIRFAGGEKCYIKAQVKARIPEVGAVTKQ----SI 130 140 150 160 170 150 160 170 180 190 pF1KE2 ECSLQAKPAVPTSKLGQAEGRDAGSAPSGGDPAFLGMAVNTLCGEVPLYYI .:..: .:.. .. : . : : .::. : :::..:.... XP_011 SSKLEGK-IMPVKYEENSLIWVAVDQPVK-DNSFLSSKVLELCGDLPIFWLKPTYPKEIF 180 190 200 210 220 230 XP_011 AEIQRERREVVRKIVPTTTKRPHSGPRSNPGAGRLNNETRPSVQEDSQAFNPDNPYHQEG 240 250 260 270 280 290 197 residues in 1 query sequences 60892289 residues in 85410 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Wed Dec 21 11:14:00 2016 done: Wed Dec 21 11:14:01 2016 Total Scan time: 6.320 Total Display time: -0.010 Function used was FASTA [36.3.4 Apr, 2011]