FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KB5878, 495 aa 1>>>pF1KB5878 495 - 495 aa - 495 aa Library: /omim/omim.rfq.tfa 60827320 residues in 85289 sequences Statistics: Expectation_n fit: rho(ln(x))= 15.7845+/-0.000679; mu= -29.3790+/- 0.042 mean_var=819.3930+/-168.923, 0's: 0 Z-trim(117.3): 62 B-trim: 0 in 0/58 Lambda= 0.044805 statistics sampled from 29095 (29136) to 29095 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.637), E-opt: 0.2 (0.342), width: 16 Scan time: 10.960 The best scores are: opt bits E(85289) NP_060835 (OMIM: 610766) meiosis-specific nuclear ( 495) 3125 217.9 5.5e-56 NP_009044 (OMIM: 190370) trichohyalin [Homo sapien (1943) 420 43.6 0.0062 NP_036469 (OMIM: 605152) cilia- and flagella-assoc ( 551) 393 41.3 0.0086 >>NP_060835 (OMIM: 610766) meiosis-specific nuclear stru (495 aa) initn: 3125 init1: 3125 opt: 3125 Z-score: 1126.1 bits: 217.9 E(85289): 5.5e-56 Smith-Waterman score: 3125; 99.8% identity (99.8% similar) in 495 aa overlap (1-495:1-495) 10 20 30 40 50 60 pF1KB5 MGSKRRNLSCSERHQKLVDENYCKKLHVQALKNVNSQIRNQMVQNENDNRVQRKQFLRLL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_060 MGSKRRNLSCSERHQKLVDENYCKKLHVQALKNVNSQIRNQMVQNENDNRVQRKQFLRLL 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB5 QNEQFELDMEEAIQKAEENKRLKELQLKQEEKLAMELAKLKHESLKDEKMRQQVRENSIE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_060 QNEQFELDMEEAIQKAEENKRLKELQLKQEEKLAMELAKLKHESLKDEKMRQQVRENSIE 70 80 90 100 110 120 130 140 150 160 170 180 pF1KB5 LRELEKKLKAAYMNKERAAQIAEKDAIKYEQMKRDAEIAKTMMEEHKRIIKEENAAEDKR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_060 LRELEKKLKAAYMNKERAAQIAEKDAIKYEQMKRDAEIAKTMMEEHKRIIKEENAAEDKR 130 140 150 160 170 180 190 200 210 220 230 240 pF1KB5 NKAKAQYYLDLEKQLEEQEKKKQEAYEQLLKEKLMIDEIVRKIYEEDQLEKQQKLEKMNA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_060 NKAKAQYYLDLEKQLEEQEKKKQEAYEQLLKEKLMIDEIVRKIYEEDQLEKQQKLEKMNA 190 200 210 220 230 240 250 260 270 280 290 300 pF1KB5 MRRYIEEFQKEQALWRKKKREEMEEENRKIIEFANMQQQREEDRMAKVQENEEKRLQLQN :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_060 MRRYIEEFQKEQALWRKKKREEMEEENRKIIEFANMQQQREEDRMAKVQENEEKRLQLQN 250 260 270 280 290 300 310 320 330 340 350 360 pF1KB5 ALTQKLEEMLRQREDLEQVRQELYQEEQAEIYKSKLKEEAEKKLRKQKEMKQDFEEQMAL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_060 ALTQKLEEMLRQREDLEQVRQELYQEEQAEIYKSKLKEEAEKKLRKQKEMKQDFEEQMAL 310 320 330 340 350 360 370 380 390 400 410 420 pF1KB5 KELVLQAAKEEEENFRKTMLAKFAEDDRIELMNAQKQRMKQLEHRRAVEKLIEERRQQFL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_060 KELVLQAAKEEEENFRKTMLAKFAEDDRIELMNAQKQRMKQLEHRRAVEKLIEERRQQFL 370 380 390 400 410 420 430 440 450 460 470 480 pF1KB5 ADKQRGLEEWQLQQRRQGFINAIIEEERLKLLKEHATNLLGYLPKGVFKKEDDIDLLGEE ::::: :::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_060 ADKQRELEEWQLQQRRQGFINAIIEEERLKLLKEHATNLLGYLPKGVFKKEDDIDLLGEE 430 440 450 460 470 480 490 pF1KB5 FRKVYQQRSEICEEK ::::::::::::::: NP_060 FRKVYQQRSEICEEK 490 >>NP_009044 (OMIM: 190370) trichohyalin [Homo sapiens] (1943 aa) initn: 350 init1: 141 opt: 420 Z-score: 173.6 bits: 43.6 E(85289): 0.0062 Smith-Waterman score: 450; 25.8% identity (62.9% similar) in 450 aa overlap (12-435:266-700) 10 20 30 40 pF1KB5 MGSKRRNLSCSERHQKLVDEN-YCKKLHVQALKNVNSQIRN .:...: .:. .::. : :. :. NP_009 RVFQEEEEKEWRKRETVLRKEEEKLQEEEPQRQRELQEEEEQLRKLERQELR------RE 240 250 260 270 280 50 60 70 80 90 pF1KB5 QMVQNENDNRVQRKQFLRLLQNEQFELDMEEAIQKAE-----ENKRLKELQLKQEEKLAM .. ......:..:.: :: :.:. . ..:: .. : :..: ..:. .:::. . NP_009 RQEEEQQQQRLRREQQLRRKQEEERREQQEERREQQERREQQEERREQQLRREQEERREQ 290 300 310 320 330 340 100 110 120 130 140 pF1KB5 ELAKLKHESLKDEKMR---------QQVRENSIELRELEKKLKAAYMNKE----RAAQIA .: . ..: .....: ::.:... .::. .. . . .: : :. NP_009 QLRREQEEERREQQLRREQEEERREQQLRREQ-QLRREQQLRREQQLRREQQLRREQQLR 350 360 370 380 390 400 150 160 170 180 190 200 pF1KB5 EKDAIKYEQMKRDAEIAKTMMEEHKRIIKEENAAEDKRNKAKAQYYLDLEKQLEEQEKKK ... .. ::. : . . .::... :.:. ...: : . . : :. :: :... NP_009 REQQLRREQQLRREQQLRREQEEERHEQKHEQERREQRLKREQEERRDWLKREEETERHE 410 420 430 440 450 460 210 220 230 240 250 260 pF1KB5 QEAYEQLLKEKLMIDEIVR--KIYEEDQLEKQQKLEKMNAMRRYIEEFQKEQALWRKKKR :: .: ::. .. : :. ::.. :.:.. :.. .:: :: ..:: : :.. NP_009 QERRKQQLKRDQEEERRERWLKLEEEERREQQERREQQ--LRREQEE-RREQ---RLKRQ 470 480 490 500 510 520 270 280 290 300 310 pF1KB5 EEMEEENRKIIEFANMQQQREEDRMAKVQENEEKRLQLQNALTQKLEEMLRQREDL---- :: :. .... ......:: : ....:::::. :. :.:.. ..:.: NP_009 EEEERLQQRLRSEQQLRREQEERREQLLKREEEKRLE-QERREQRLKREQEERRDQLLKR 530 540 550 560 570 580 320 330 340 350 360 370 pF1KB5 -EQVRQELYQEEQAEIYKSKLKEEAEKKLRKQKEMKQDFEEQMALKELVLQAAKEEEENF :. ::. ..:: : ...::.: ..:..... .: .... .: : : ::.. NP_009 EEERRQQRLKREQEERLEQRLKREEVERLEQEERREQRLKREEPEEERRQQLLKSEEQEE 590 600 610 620 630 640 380 390 400 410 420 430 pF1KB5 RKTMLAKFAEDDRIELMNAQKQRMKQLEHRRAVEKLIEERRQQFLADKQRGLEEWQLQQR :. . . ...: : .... ..::.: :. ::::.: ::.... . ....: NP_009 RRQQQLRREQQERREQRLKREEEEERLEQRLKREHE-EERREQELAEEEQEQARERIKSR 650 660 670 680 690 700 440 450 460 470 480 490 pF1KB5 RQGFINAIIEEERLKLLKEHATNLLGYLPKGVFKKEDDIDLLGEEFRKVYQQRSEICEEK NP_009 IPKWQWQLESEADARQSKVYSRPRKQEGQRRRQEQEEKRRRRESELQWQEEERAHRQQQE 710 720 730 740 750 760 >-- initn: 350 init1: 141 opt: 427 Z-score: 176.1 bits: 44.0 E(85289): 0.0046 Smith-Waterman score: 433; 23.8% identity (60.6% similar) in 513 aa overlap (11-487:777-1281) 10 20 30 pF1KB5 MGSKRRNLSCSER-HQKLVDENYCKKLHVQALKNVNSQIR ::: .:.: . .. . . :. . : : NP_009 QWQEEERAHRQQQEEEQRRDFTWQWQAEEKSERGRQRLSARPPLREQRERQLRAEERQQR 750 760 770 780 790 800 40 50 60 70 80 90 pF1KB5 NQMVQNENDNRVQRKQFLRLLQNE-QFELDMEEAIQKAEENKRLKE----LQLKQEEKLA .: :.... ::.. : ..: :: :. :: .:. :. ..:.: :: ::.. . NP_009 EQRFLPEEEEKEQRRRQRREREKELQF-LEEEEQLQRRERAQQLQEEEDGLQEDQERRRS 810 820 830 840 850 860 100 110 120 130 140 150 pF1KB5 MELAKLKHESLKDEKMRQQVREN---SIELRELEKKLKAAYMNKERAAQIAEKDAIKYEQ .: . .. . :. :.. :.. . :.: .: . ...:. : :.. . .. NP_009 QEQRRDQKWRWQLEEERKRRRHTLYAKPALQEQLRKEQQLLQEEEEELQREEREKRRRQE 870 880 890 900 910 920 160 170 180 190 200 pF1KB5 MKRDAEIAKTMMEEHKRIIKEENAAEDKRNKAKAQYYLDL-----EKQL--EEQEKKKQE ..:. . . ...:......:: . .:.. . :: : :.:: :: ::.... NP_009 QERQYREEEQLQQEEEQLLREERE-KRRRQERERQYRKDKKLQQKEEQLLGEEPEKRRRQ 930 940 950 960 970 980 210 220 230 240 250 260 pF1KB5 AYEQLLKEKLMIDEIVRKIYEEDQLEKQQKLEKMNAMRRYIEEFQKEQALWRKKK-REEM :. .:. ... ... .:.. ::... : .:. : :.:. : :... .... NP_009 EREKKYREEEELQQEEEQLLREER-EKRRRQEWERQYRKKDELQQEEEQLLREEREKRRL 990 1000 1010 1020 1030 1040 270 280 290 300 310 pF1KB5 EEENRKIIEFANMQQQRE----EDRMAKVQENEEKRLQLQNALTQKLEEMLRQ----RED .:..:. : ..::..: :.: .. ... :.. . .. : :. :..::. :. NP_009 QERERQYREEEELQQEEEQLLGEERETRRRQELERQYRKEEELQQEEEQLLREEPEKRRR 1050 1060 1070 1080 1090 1100 320 330 340 350 360 370 pF1KB5 LEQVRQELYQEEQAEIYKSKLKEEAEKKLRKQKEMKQDFEEQMALKELVLQAAKEEEENF :. :: .:: . .. :.:: :: :...:......:. ... : .:: :. NP_009 QERERQCREEEELQQEEEQLLREEREK--RRRQELERQYREEEEVQQEEEQLLREEPEKR 1110 1120 1130 1140 1150 1160 380 390 400 410 420 pF1KB5 RKTMLAK-FAEDDRIE-----LMNAQKQRMKQLEHR--RAVEKLIEERRQQFLADK-QRG :. : . . :..... :. .... .: ..: : :.: ...:.: :. ::. NP_009 RRQELERQYREEEELQQEEEQLLREEQEKRRQERERQYREEEELQRQKRKQRYRDEDQRS 1170 1180 1190 1200 1210 1220 430 440 450 460 470 480 pF1KB5 LEEWQLQQRRQGFI--NAIIEEERLKLLKEHATNLLGYLPKGVFKKEDDIDLLGEEFRKV .:: . .... . : . . : . :. .: . ...: ::::. .. NP_009 DLKWQWEPEKENAVRDNKVYCKGREN---EQFRQLEDSQLRDRQSQQDLQHLLGEQQERD 1230 1240 1250 1260 1270 490 pF1KB5 YQQRSEICEEK .: NP_009 REQERRRWQQRDRHFPEEEQLEREEQKEAKRRDRKSQEEKQLLREEREEKRRRQETDRKF 1280 1290 1300 1310 1320 1330 >>NP_036469 (OMIM: 605152) cilia- and flagella-associate (551 aa) initn: 300 init1: 300 opt: 393 Z-score: 171.1 bits: 41.3 E(85289): 0.0086 Smith-Waterman score: 393; 24.8% identity (63.5% similar) in 419 aa overlap (61-459:127-530) 40 50 60 70 80 pF1KB5 LKNVNSQIRNQMVQNENDNRVQRKQFLRLLQNEQFELDMEEAIQKAEENKRL---KELQL ... :. . : ... . :.. ::. NP_036 PSGESLIISPEEFERIKWASHVLTREELEARDQAFKKEKEATMDAVMTRKKIMKQKEMVW 100 110 120 130 140 150 90 100 110 120 130 140 pF1KB5 KQEEKLA--MELAKLKHESL--KDEKMRQQVRENSIELRELEKKLKAAYMNKERAAQIAE ....::. :.:: . ..: . .:.:.. .: ::... : . : . : ::: : NP_036 NNNKKLSDLEEVAKERAQNLLQRANKLRMEQEE---ELKDMSKIILNAKCHAIRDAQILE 160 170 180 190 200 210 150 160 170 180 190 200 pF1KB5 KDAIKYEQMKRDAEIAKTMMEEHKRIIKEENAAEDKRNKAKAQYYLDLEKQLEEQEKKKQ :. :. : .. .. . : :... :.... : :: . . . .. .:.:....... NP_036 KQQIQKELDTEEKRLDQMMEVERQKSIQRQEELERKRREERIRGRRQIVEQMEKNQEERS 220 230 240 250 260 270 210 220 230 240 250 pF1KB5 EAYEQLLKEKLMIDEIVRKIYEED--QLEK--QQKLEKMNAMRRYIEEFQKEQA--LWRK :: .:: .. : .... ::: ..:. ::::. . ..: .: ::..: : .. NP_036 LLAEQREQEKEQMLEYMEQLQEEDLKDMERRQQQKLKMQAEIKRINDENQKQKAELLAQE 280 290 300 310 320 330 260 270 280 290 300 310 pF1KB5 KKREEMEEENRKIIEFANMQQQRE---EDRMAKVQENEEKRLQLQNALTQKLEEMLRQRE : ..: ..::.. .. :: : .. ......::.. :. .: ... ... NP_036 KLADQM------VMEFTKKKMAREAEFEAEQERIRREKEKEIARLRAMQEKAQDYQAEQD 340 350 360 370 380 320 330 340 350 360 370 pF1KB5 DLEQVR-QELYQEEQAEIYKSKLKEEAEKKLRKQKEMKQDFEEQMALKE--LVLQAAKEE :. : ::. ..: .. : ::.:.::.. . :.... ::.:.:: :..:. ... NP_036 ALRAKRNQEVADRE----WRRKEKENARKKMETEAELRKSRLEQVAFKEHALAVQVQRDR 390 400 410 420 430 440 380 390 400 410 420 430 pF1KB5 EENFRKTMLAKFAEDDRIELMNAQKQRMKQLEHRRAVEKLIEERRQQFLADKQRGLEEWQ .: :.. . :. . .. : .. .:. .:.: ... ..: .:. . .. .:: . NP_036 DE-FERILRAQREQIEK-ERLEEEKKATGRLQHANELRRQVRENQQKEVQNRIATFEEGR 450 460 470 480 490 500 440 450 460 470 480 490 pF1KB5 -LQQRRQGFINAIIEEERLKLLKEHATNLLGYLPKGVFKKEDDIDLLGEEFRKVYQQRSE :... : . : : .: :: . .::.: NP_036 RLKEEAQKRRERIDEIKRKKLEELRATGLPEKYCIEAERKANILPATSVN 510 520 530 540 550 495 residues in 1 query sequences 60827320 residues in 85289 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Sat Nov 5 08:21:01 2016 done: Sat Nov 5 08:21:03 2016 Total Scan time: 10.960 Total Display time: 0.010 Function used was FASTA [36.3.4 Apr, 2011]