FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KB3085, 211 aa 1>>>pF1KB3085 211 - 211 aa - 211 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 4.8355+/-0.000731; mu= 17.5946+/- 0.044 mean_var=73.4865+/-14.410, 0's: 0 Z-trim(110.4): 45 B-trim: 0 in 0/51 Lambda= 0.149613 statistics sampled from 11538 (11583) to 11538 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.722), E-opt: 0.2 (0.356), width: 16 Scan time: 2.160 The best scores are: opt bits E(32554) CCDS3295.1 CLDN1 gene_id:9076|Hs108|chr3 ( 211) 1419 314.7 2.7e-86 CCDS11096.1 CLDN7 gene_id:1366|Hs108|chr17 ( 211) 925 208.0 3.4e-54 CCDS44125.1 CLDN19 gene_id:149461|Hs108|chr1 ( 211) 887 199.8 1e-51 CCDS471.1 CLDN19 gene_id:149461|Hs108|chr1 ( 224) 881 198.6 2.6e-51 CCDS5559.1 CLDN3 gene_id:1365|Hs108|chr7 ( 220) 734 166.8 9.1e-42 CCDS5560.1 CLDN4 gene_id:1364|Hs108|chr7 ( 209) 729 165.7 1.9e-41 CCDS10487.1 CLDN9 gene_id:9080|Hs108|chr16 ( 217) 727 165.3 2.6e-41 CCDS10488.1 CLDN6 gene_id:9074|Hs108|chr16 ( 220) 685 156.3 1.4e-38 CCDS13763.2 CLDN5 gene_id:7122|Hs108|chr22 ( 303) 684 156.2 2e-38 CCDS13645.1 CLDN14 gene_id:23562|Hs108|chr21 ( 239) 649 148.5 3.2e-36 CCDS54081.1 CLDN7 gene_id:1366|Hs108|chr17 ( 145) 619 141.8 2e-34 CCDS53306.1 CLDN19 gene_id:149461|Hs108|chr1 ( 218) 608 139.6 1.4e-33 CCDS13586.1 CLDN17 gene_id:26285|Hs108|chr21 ( 224) 602 138.3 3.5e-33 CCDS13587.1 CLDN8 gene_id:9073|Hs108|chr21 ( 225) 588 135.3 2.8e-32 CCDS14524.1 CLDN2 gene_id:9075|Hs108|chrX ( 230) 529 122.6 2e-28 CCDS5249.1 CLDN20 gene_id:49861|Hs108|chr6 ( 219) 493 114.8 4.1e-26 CCDS9476.1 CLDN10 gene_id:9071|Hs108|chr13 ( 228) 490 114.2 6.7e-26 CCDS5717.1 CLDN15 gene_id:24146|Hs108|chr7 ( 228) 479 111.8 3.5e-25 CCDS9475.1 CLDN10 gene_id:9071|Hs108|chr13 ( 226) 449 105.3 3.1e-23 CCDS54824.1 CLDN24 gene_id:100132463|Hs108|chr4 ( 220) 446 104.7 4.7e-23 CCDS43286.1 CLDN22 gene_id:53842|Hs108|chr4 ( 220) 424 99.9 1.3e-21 CCDS44736.1 CLDN25 gene_id:644672|Hs108|chr11 ( 229) 382 90.9 7e-19 CCDS3213.1 CLDN11 gene_id:5010|Hs108|chr3 ( 207) 379 90.2 1e-18 CCDS33862.1 CLDN18 gene_id:51208|Hs108|chr3 ( 261) 355 85.1 4.3e-17 CCDS3095.1 CLDN18 gene_id:51208|Hs108|chr3 ( 261) 355 85.1 4.3e-17 CCDS3296.1 CLDN16 gene_id:10686|Hs108|chr3 ( 305) 280 69.0 3.6e-12 >>CCDS3295.1 CLDN1 gene_id:9076|Hs108|chr3 (211 aa) initn: 1419 init1: 1419 opt: 1419 Z-score: 1662.7 bits: 314.7 E(32554): 2.7e-86 Smith-Waterman score: 1419; 100.0% identity (100.0% similar) in 211 aa overlap (1-211:1-211) 10 20 30 40 50 60 pF1KB3 MANAGLQLLGFILAFLGWIGAIVSTALPQWRIYSYAGDNIVTAQAMYEGLWMSCVSQSTG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS32 MANAGLQLLGFILAFLGWIGAIVSTALPQWRIYSYAGDNIVTAQAMYEGLWMSCVSQSTG 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB3 QIQCKVFDSLLNLSSTLQATRALMVVGILLGVIAIFVATVGMKCMKCLEDDEVQKMRMAV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS32 QIQCKVFDSLLNLSSTLQATRALMVVGILLGVIAIFVATVGMKCMKCLEDDEVQKMRMAV 70 80 90 100 110 120 130 140 150 160 170 180 pF1KB3 IGGAIFLLAGLAILVATAWYGNRIVQEFYDPMTPVNARYEFGQALFTGWAAASLCLLGGA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS32 IGGAIFLLAGLAILVATAWYGNRIVQEFYDPMTPVNARYEFGQALFTGWAAASLCLLGGA 130 140 150 160 170 180 190 200 210 pF1KB3 LLCCSCPRKTTSYPTPRPYPKPAPSSGKDYV ::::::::::::::::::::::::::::::: CCDS32 LLCCSCPRKTTSYPTPRPYPKPAPSSGKDYV 190 200 210 >>CCDS11096.1 CLDN7 gene_id:1366|Hs108|chr17 (211 aa) initn: 910 init1: 910 opt: 925 Z-score: 1086.4 bits: 208.0 E(32554): 3.4e-54 Smith-Waterman score: 925; 60.6% identity (86.4% similar) in 213 aa overlap (1-211:1-211) 10 20 30 40 50 60 pF1KB3 MANAGLQLLGFILAFLGWIGAIVSTALPQWRIYSYAGDNIVTAQAMYEGLWMSCVSQSTG :::.::::::: .:.:::.: .. ::.:::.. :::::::.::::::.::::.::.:::: CCDS11 MANSGLQLLGFSMALLGWVGLVACTAIPQWQMSSYAGDNIITAQAMYKGLWMDCVTQSTG 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB3 QIQCKVFDSLLNLSSTLQATRALMVVGILLGVIAIFVATVGMKCMKCLEDDEVQKMRMAV ...::..::.: ::..::::::::::...:: .:.::::.:::: .: ::.:.: :.:. CCDS11 MMSCKMYDSVLALSAALQATRALMVVSLVLGFLAMFVATMGMKCTRCGGDDKVKKARIAM 70 80 90 100 110 120 130 140 150 160 170 180 pF1KB3 IGGAIFLLAGLAILVATAWYGNRIVQEFYDPMTPVNARYEFGQALFTGWAAASLCLLGGA :: ::..:::: ::: .:::..:: .::.:. :.: .:::: :.: :::...: .:::: CCDS11 GGGIIFIVAGLAALVACSWYGHQIVTDFYNPLIPTNIKYEFGPAIFIGWAGSALVILGGA 130 140 150 160 170 180 190 200 210 pF1KB3 LLCCSCP--RKTTSYPTPRPYPKPAPSSGKDYV :: :::: .. ..: .:: ::: .:.:.:: CCDS11 LLSCSCPGNESKAGYRVPRSYPKS--NSSKEYV 190 200 210 >>CCDS44125.1 CLDN19 gene_id:149461|Hs108|chr1 (211 aa) initn: 876 init1: 876 opt: 887 Z-score: 1042.1 bits: 199.8 E(32554): 1e-51 Smith-Waterman score: 887; 57.1% identity (85.4% similar) in 212 aa overlap (1-211:1-211) 10 20 30 40 50 60 pF1KB3 MANAGLQLLGFILAFLGWIGAIVSTALPQWRIYSYAGDNIVTAQAMYEGLWMSCVSQSTG :::.::::::..::. ::.: :.:::::::. ::::: :.:: ..::::::::.::::: CCDS44 MANSGLQLLGYFLALGGWVGIIASTALPQWKQSSYAGDAIITAVGLYEGLWMSCASQSTG 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB3 QIQCKVFDSLLNLSSTLQATRALMVVGILLGVIAIFVATVGMKCMKCLEDDEVQKMRMAV :.:::..:::: :.. .:..::::::..::: .:. ...::::: . ... . : :.:. CCDS44 QVQCKLYDSLLALDGHIQSARALMVVAVLLGFVAMVLSVVGMKCTRVGDSNPIAKGRVAI 70 80 90 100 110 120 130 140 150 160 170 180 pF1KB3 IGGAIFLLAGLAILVATAWYGNRIVQEFYDPMTPVNARYEFGQALFTGWAAASLCLLGGA :::.:.:::: :.:..::.. ..:::..: :::::::::: :::.:::.:.: .:::. CCDS44 AGGALFILAGLCTLTAVSWYATLVTQEFFNPSTPVNARYEFGPALFVGWASAGLAVLGGS 130 140 150 160 170 180 190 200 210 pF1KB3 LLCCSCPRKTTSYPTPRPYPKPAPSSG-KDYV .:::.::. .:.:: .:.::.. ..:: CCDS44 FLCCTCPEPERPNSSPQPY-RPGPSAAAREYV 190 200 210 >>CCDS471.1 CLDN19 gene_id:149461|Hs108|chr1 (224 aa) initn: 864 init1: 864 opt: 881 Z-score: 1034.7 bits: 198.6 E(32554): 2.6e-51 Smith-Waterman score: 881; 57.8% identity (85.4% similar) in 206 aa overlap (1-206:1-205) 10 20 30 40 50 60 pF1KB3 MANAGLQLLGFILAFLGWIGAIVSTALPQWRIYSYAGDNIVTAQAMYEGLWMSCVSQSTG :::.::::::..::. ::.: :.:::::::. ::::: :.:: ..::::::::.::::: CCDS47 MANSGLQLLGYFLALGGWVGIIASTALPQWKQSSYAGDAIITAVGLYEGLWMSCASQSTG 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB3 QIQCKVFDSLLNLSSTLQATRALMVVGILLGVIAIFVATVGMKCMKCLEDDEVQKMRMAV :.:::..:::: :.. .:..::::::..::: .:. ...::::: . ... . : :.:. CCDS47 QVQCKLYDSLLALDGHIQSARALMVVAVLLGFVAMVLSVVGMKCTRVGDSNPIAKGRVAI 70 80 90 100 110 120 130 140 150 160 170 180 pF1KB3 IGGAIFLLAGLAILVATAWYGNRIVQEFYDPMTPVNARYEFGQALFTGWAAASLCLLGGA :::.:.:::: :.:..::.. ..:::..: :::::::::: :::.:::.:.: .:::. CCDS47 AGGALFILAGLCTLTAVSWYATLVTQEFFNPSTPVNARYEFGPALFVGWASAGLAVLGGS 130 140 150 160 170 180 190 200 210 pF1KB3 LLCCSCPRKTTSYPTPRPYPKPAPSSGKDYV .:::.::. .:.:: .:.::. CCDS47 FLCCTCPEPERPNSSPQPY-RPGPSAAAREPVVKLPASAKGPLGV 190 200 210 220 >>CCDS5559.1 CLDN3 gene_id:1365|Hs108|chr7 (220 aa) initn: 745 init1: 467 opt: 734 Z-score: 863.3 bits: 166.8 E(32554): 9.1e-42 Smith-Waterman score: 735; 50.2% identity (77.2% similar) in 219 aa overlap (5-211:4-220) 10 20 30 40 50 60 pF1KB3 MANAGLQLLGFILAFLGWIGAIVSTALPQWRIYSYAGDNIVTAQAMYEGLWMSCVSQSTG ::.. : :: :::.:.:: :::.::. .. :.::.:.: ..:::::.:: :::: CCDS55 MSMGLEITGTALAVLGWLGTIVCCALPMWRVSAFIGSNIITSQNIWEGLWMNCVVQSTG 10 20 30 40 50 70 80 90 100 110 120 pF1KB3 QIQCKVFDSLLNLSSTLQATRALMVVGILLGVIAIFVATVGMKCMKCLEDDEVQKMRMAV :.::::.:::: : . :::.:::.::.:::......:: :: .: .:..:: . : .... CCDS55 QMQCKVYDSLLALPQDLQAARALIVVAILLAAFGLLVALVGAQCTNCVQDDTA-KAKITI 60 70 80 90 100 110 130 140 150 160 170 180 pF1KB3 IGGAIFLLAGLAILVATAWYGNRIVQEFYDPMTPVNARYEFGQALFTGWAAASLCLLGGA ..:..::::.: :: ..: .: :...::.:..: . :.: .:..:::::.: ::::: CCDS55 VAGVLFLLAALLTLVPVSWSANTIIRDFYNPVVPEAQKREMGAGLYVGWAAAALQLLGGA 120 130 140 150 160 170 190 200 210 pF1KB3 LLCCSCP---RKTTS----YPTPRPYPKPAPSSG-----KDYV ::::::: .: :. : .:: :. : : :::: CCDS55 LLCCSCPPREKKYTATKVVYSAPRS-TGPGASLGTGYDRKDYV 180 190 200 210 220 >>CCDS5560.1 CLDN4 gene_id:1364|Hs108|chr7 (209 aa) initn: 822 init1: 488 opt: 729 Z-score: 857.8 bits: 165.7 E(32554): 1.9e-41 Smith-Waterman score: 729; 46.0% identity (80.6% similar) in 211 aa overlap (1-211:1-209) 10 20 30 40 50 60 pF1KB3 MANAGLQLLGFILAFLGWIGAIVSTALPQWRIYSYAGDNIVTAQAMYEGLWMSCVSQSTG ::. :::..:. :: :::..... :::.::. .. :.::::.:...:::::.:: :::: CCDS55 MASMGLQVMGIALAVLGWLAVMLCCALPMWRVTAFIGSNIVTSQTIWEGLWMNCVVQSTG 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB3 QIQCKVFDSLLNLSSTLQATRALMVVGILLGVIAIFVATVGMKCMKCLEDDEVQKMRMAV :.::::.:::: : . :::.:::....:...........:: :: .:::: : : . . CCDS55 QMQCKVYDSLLALPQDLQAARALVIISIIVAALGVLLSVVGGKCTNCLED-ESAKAKTMI 70 80 90 100 110 130 140 150 160 170 180 pF1KB3 IGGAIFLLAGLAILVATAWYGNRIVQEFYDPMTPVNARYEFGQALFTGWAAASLCLLGGA ..:..:::::: ..: ..: .. :.:.::.:.. . . :.: .:..::::..: ::::. CCDS55 VAGVVFLLAGLMVIVPVSWTAHNIIQDFYNPLVASGQKREMGASLYVGWAASGLLLLGGG 120 130 140 150 160 170 190 200 210 pF1KB3 LLCCSCPRKTTSYPTPRPYPKPAPSSGKDYV ::::.:: .: . : : .....:: CCDS55 LLCCNCPPRTDK-PYSAKYSAARSAAASNYV 180 190 200 >>CCDS10487.1 CLDN9 gene_id:9080|Hs108|chr16 (217 aa) initn: 475 init1: 475 opt: 727 Z-score: 855.3 bits: 165.3 E(32554): 2.6e-41 Smith-Waterman score: 727; 49.1% identity (77.1% similar) in 218 aa overlap (1-211:1-217) 10 20 30 40 50 60 pF1KB3 MANAGLQLLGFILAFLGWIGAIVSTALPQWRIYSYAGDNIVTAQAMYEGLWMSCVSQSTG ::..::.:::. :: :::.:..:: ::: :.. .. :..::.::...:::::::: :::: CCDS10 MASTGLELLGMTLAVLGWLGTLVSCALPLWKVTAFIGNSIVVAQVVWEGLWMSCVVQSTG 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB3 QIQCKVFDSLLNLSSTLQATRALMVVGILLGVIAIFVATVGMKCMKCLEDDEVQKMRMAV :.::::.:::: : . :::.::: :...::......:: .: .: :.:: : : :... CCDS10 QMQCKVYDSLLALPQDLQAARALCVIALLLALLGLLVAITGAQCTTCVED-EGAKARIVL 70 80 90 100 110 130 140 150 160 170 180 pF1KB3 IGGAIFLLAGLAILVATAWYGNRIVQEFYDPMTPVNARYEFGQALFTGWAAASLCLLGGA .:.:.::::. .:. . : .. :.:.::.:.. . :.: .:. :::::.: .:::. CCDS10 TAGVILLLAGILVLIPVCWTAHAIIQDFYNPLVAEALKRELGASLYLGWAAAALLMLGGG 120 130 140 150 160 170 190 200 210 pF1KB3 LLCCSCPRKTTSYPT-PR-PYPKPAPS--SG---KDYV ::::.:: . : :: : :. : :: .::: CCDS10 LLCCTCPPPQVERPRGPRLGYSIPSRSGASGLDKRDYV 180 190 200 210 >>CCDS10488.1 CLDN6 gene_id:9074|Hs108|chr16 (220 aa) initn: 462 init1: 441 opt: 685 Z-score: 806.2 bits: 156.3 E(32554): 1.4e-38 Smith-Waterman score: 685; 44.0% identity (78.3% similar) in 207 aa overlap (1-205:1-206) 10 20 30 40 50 60 pF1KB3 MANAGLQLLGFILAFLGWIGAIVSTALPQWRIYSYAGDNIVTAQAMYEGLWMSCVSQSTG ::.::.:.:: .:..:::....:: :::.:.. .. :..::.::...:::::::: :::: CCDS10 MASAGMQILGVVLTLLGWVNGLVSCALPMWKVTAFIGNSIVVAQVVWEGLWMSCVVQSTG 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB3 QIQCKVFDSLLNLSSTLQATRALMVVGILLGVIAIFVATVGMKCMKCLEDDEVQKMRMAV :.::::.:::: : . :::.::: :...:.......: .: :: :.:. . .: :... CCDS10 QMQCKVYDSLLALPQDLQAARALCVIALLVALFGLLVYLAGAKCTTCVEEKD-SKARLVL 70 80 90 100 110 130 140 150 160 170 180 pF1KB3 IGGAIFLLAGLAILVATAWYGNRIVQEFYDPMTPVNARYEFGQALFTGWAAASLCLLGGA .: .:...:. :. . : .. :...::.:.. . :.: .:. ::::..: ::::. CCDS10 TSGIVFVISGVLTLIPVCWTAHAIIRDFYNPLVAEAQKRELGASLYLGWAASGLLLLGGG 120 130 140 150 160 170 190 200 210 pF1KB3 LLCCSCPRKTTSYPTPRP--YPKPAPSSGKDYV ::::.:: .. :. : ::. CCDS10 LLCCTCPSGGSQGPSHYMARYSTSAPAISRGPSEYPTKNYV 180 190 200 210 220 >>CCDS13763.2 CLDN5 gene_id:7122|Hs108|chr22 (303 aa) initn: 662 init1: 397 opt: 684 Z-score: 803.2 bits: 156.2 E(32554): 2e-38 Smith-Waterman score: 684; 45.3% identity (74.3% similar) in 214 aa overlap (1-210:86-297) 10 20 30 pF1KB3 MANAGLQLLGFILAFLGWIGAIVSTALPQW :..:.:..::..: ..:: : :.. .::.: CCDS13 GAKAPGPAQGAAQHGLGGSAGLRVRVSPLAMGSAALEILGLVLCLVGWGGLILACGLPMW 60 70 80 90 100 110 40 50 60 70 80 90 pF1KB3 RIYSYAGDNIVTAQAMYEGLWMSCVSQSTGQIQCKVFDSLLNLSSTLQATRALMVVGILL .. .. ::::::. ..::::::: ::::..::::.::.: ::. .::.::: : ..:: CCDS13 QVTAFLDHNIVTAQTTWKGLWMSCVVQSTGHMQCKVYDSVLALSTEVQAARALTVSAVLL 120 130 140 150 160 170 100 110 120 130 140 150 pF1KB3 GVIAIFVATVGMKCMKCLEDDEVQKMRMAVIGGAIFLLAGLAILVATAWYGNRIVQEFYD . .:.::. .: .: :. . : :.:. ::...:. :: :: :..: .:.:::: CCDS13 AFVALFVTLAGAQCTTCVAPGPA-KARVALTGGVLYLFCGLLALVPLCWFANIVVREFYD 180 190 200 210 220 230 160 170 180 190 200 pF1KB3 PMTPVNARYEFGQALFTGWAAASLCLLGGALLCCS---CP-RKTTSYPTPRPYPKPAPSS : .::. .::.: ::. ::::..: ..:: ::::. : : :.:. :. :.. CCDS13 PSVPVSQKYELGAALYIGWAATALLMVGGCLLCCGAWVCTGRPDLSFPVKYSAPR-RPTA 240 250 260 270 280 290 210 pF1KB3 GKDYV :: CCDS13 TGDYDKKNYV 300 >>CCDS13645.1 CLDN14 gene_id:23562|Hs108|chr21 (239 aa) initn: 660 init1: 386 opt: 649 Z-score: 763.7 bits: 148.5 E(32554): 3.2e-36 Smith-Waterman score: 653; 44.8% identity (72.4% similar) in 221 aa overlap (1-209:1-219) 10 20 30 40 50 60 pF1KB3 MANAGLQLLGFILAFLGWIGAIVSTALPQWRIYSYAGDNIVTAQAMYEGLWMSCVSQSTG ::....:::::.:.::: .:....: ::.:: ...: ::.:: .. .:::: :: .::: CCDS13 MASTAVQLLGFLLSFLGMVGTLITTILPHWRRTAHVGTNILTAVSYLKGLWMECVWHSTG 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB3 QIQCKVFDSLLNLSSTLQATRALMVVGILLGVIAIFVATVGMKCMKCLEDDEVQKMRMAV ::... ::: : . :::.:::::.. ::. :: :..:::: .: . . : .:. CCDS13 IYQCQIYRSLLALPQDLQAARALMVISCLLSGIACACAVIGMKCTRCAKGTPA-KTTFAI 70 80 90 100 110 130 140 150 160 170 180 pF1KB3 IGGAIFLLAGLAILVATAWYGNRIVQEFYDPMTPVNARYEFGQALFTGWAAASLCLLGGA .::..:.:::: .::..: : .::.::.:. : . ..:.::::. :. ..:: :.::. CCDS13 LGGTLFILAGLLCMVAVSWTTNDVVQNFYNPLLPSGMKFEIGQALYLGFISSSLSLIGGT 120 130 140 150 160 170 190 200 210 pF1KB3 LLCCSC------------PRKTTSYPTPRPYPKPAPSSGKDYV ::: :: :: ::. . : .: :.. :: CCDS13 LLCLSCQDEAPYRPYQAPPRATTTTANTAPAYQP-PAAYKDNRAPSVTSATHSGYRLNDY 180 190 200 210 220 230 CCDS13 V 211 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Sat Nov 5 04:50:02 2016 done: Sat Nov 5 04:50:02 2016 Total Scan time: 2.160 Total Display time: -0.020 Function used was FASTA [36.3.4 Apr, 2011]