FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KSDA0264, 414 aa 1>>>pF1KSDA0264 414 - 414 aa - 414 aa Library: /omim/omim.rfq.tfa 60827320 residues in 85289 sequences Statistics: Expectation_n fit: rho(ln(x))= 6.9890+/-0.000465; mu= 8.9586+/- 0.029 mean_var=113.5642+/-22.217, 0's: 0 Z-trim(113.2): 26 B-trim: 440 in 1/53 Lambda= 0.120352 statistics sampled from 22385 (22400) to 22385 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.627), E-opt: 0.2 (0.263), width: 16 Scan time: 7.820 The best scores are: opt bits E(85289) NP_055899 (OMIM: 611989) 28S ribosomal protein S27 ( 414) 2689 478.2 1.7e-134 NP_001273680 (OMIM: 611989) 28S ribosomal protein ( 358) 2326 415.1 1.4e-115 NP_001273677 (OMIM: 611989) 28S ribosomal protein ( 428) 2091 374.4 3.1e-103 XP_005248658 (OMIM: 615484) PREDICTED: pentatricop ( 350) 192 44.6 0.00047 NP_079030 (OMIM: 615484) pentatricopeptide repeat- ( 388) 192 44.6 0.00051 >>NP_055899 (OMIM: 611989) 28S ribosomal protein S27, mi (414 aa) initn: 2689 init1: 2689 opt: 2689 Z-score: 2535.8 bits: 478.2 E(85289): 1.7e-134 Smith-Waterman score: 2689; 99.8% identity (99.8% similar) in 414 aa overlap (1-414:1-414) 10 20 30 40 50 60 pF1KSD MAASIVRRGMLLARQVVLPQLSPAGKRYLLSSAYVDSHKWEAREKEHYCLADLASLMDKT :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_055 MAASIVRRGMLLARQVVLPQLSPAGKRYLLSSAYVDSHKWEAREKEHYCLADLASLMDKT 10 20 30 40 50 60 70 80 90 100 110 120 pF1KSD FERKLPVSSLTISRLIDNISSREEIDHAEYYLYKFRHSPNCWYLRNWTIHTWIRQCLKYD :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_055 FERKLPVSSLTISRLIDNISSREEIDHAEYYLYKFRHSPNCWYLRNWTIHTWIRQCLKYD 70 80 90 100 110 120 130 140 150 160 170 180 pF1KSD AQDKALYTLVNKVQYGIFPDNFTFNLLMDSFIKKENYKDALSVVFEVMMQEAFEVPSTQL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_055 AQDKALYTLVNKVQYGIFPDNFTFNLLMDSFIKKENYKDALSVVFEVMMQEAFEVPSTQL 130 140 150 160 170 180 190 200 210 220 230 240 pF1KSD LSLYVLFHCLAKKTDFSWEEERNFGASLLLPGLKQKNSVGFSSQLYGYALLGKVELQQGL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_055 LSLYVLFHCLAKKTDFSWEEERNFGASLLLPGLKQKNSVGFSSQLYGYALLGKVELQQGL 190 200 210 220 230 240 250 260 270 280 290 300 pF1KSD RAVYHNMPLIWKPGYLDRALQVMEKVAASPEDIKLCREALDVLDAVLKALTSADGASEEQ ::::::::::::::::::::::::::::::::::::::::::: :::::::::::::::: NP_055 RAVYHNMPLIWKPGYLDRALQVMEKVAASPEDIKLCREALDVLGAVLKALTSADGASEEQ 250 260 270 280 290 300 310 320 330 340 350 360 pF1KSD SQNDEDNQGSEKLVEQLDIEETEQSKLPQYLERFKALHSKLQALGKIESEGLLSLTTQLV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_055 SQNDEDNQGSEKLVEQLDIEETEQSKLPQYLERFKALHSKLQALGKIESEGLLSLTTQLV 310 320 330 340 350 360 370 380 390 400 410 pF1KSD KEKLSTCEAEDIATYEQNLQQWHLDLVQLIQREQQQREQAKQEYQAQKAAKASA :::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_055 KEKLSTCEAEDIATYEQNLQQWHLDLVQLIQREQQQREQAKQEYQAQKAAKASA 370 380 390 400 410 >>NP_001273680 (OMIM: 611989) 28S ribosomal protein S27, (358 aa) initn: 2326 init1: 2326 opt: 2326 Z-score: 2196.1 bits: 415.1 E(85289): 1.4e-115 Smith-Waterman score: 2326; 99.7% identity (99.7% similar) in 358 aa overlap (57-414:1-358) 30 40 50 60 70 80 pF1KSD RYLLSSAYVDSHKWEAREKEHYCLADLASLMDKTFERKLPVSSLTISRLIDNISSREEID :::::::::::::::::::::::::::::: NP_001 MDKTFERKLPVSSLTISRLIDNISSREEID 10 20 30 90 100 110 120 130 140 pF1KSD HAEYYLYKFRHSPNCWYLRNWTIHTWIRQCLKYDAQDKALYTLVNKVQYGIFPDNFTFNL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 HAEYYLYKFRHSPNCWYLRNWTIHTWIRQCLKYDAQDKALYTLVNKVQYGIFPDNFTFNL 40 50 60 70 80 90 150 160 170 180 190 200 pF1KSD LMDSFIKKENYKDALSVVFEVMMQEAFEVPSTQLLSLYVLFHCLAKKTDFSWEEERNFGA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 LMDSFIKKENYKDALSVVFEVMMQEAFEVPSTQLLSLYVLFHCLAKKTDFSWEEERNFGA 100 110 120 130 140 150 210 220 230 240 250 260 pF1KSD SLLLPGLKQKNSVGFSSQLYGYALLGKVELQQGLRAVYHNMPLIWKPGYLDRALQVMEKV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 SLLLPGLKQKNSVGFSSQLYGYALLGKVELQQGLRAVYHNMPLIWKPGYLDRALQVMEKV 160 170 180 190 200 210 270 280 290 300 310 320 pF1KSD AASPEDIKLCREALDVLDAVLKALTSADGASEEQSQNDEDNQGSEKLVEQLDIEETEQSK ::::::::::::::::: :::::::::::::::::::::::::::::::::::::::::: NP_001 AASPEDIKLCREALDVLGAVLKALTSADGASEEQSQNDEDNQGSEKLVEQLDIEETEQSK 220 230 240 250 260 270 330 340 350 360 370 380 pF1KSD LPQYLERFKALHSKLQALGKIESEGLLSLTTQLVKEKLSTCEAEDIATYEQNLQQWHLDL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 LPQYLERFKALHSKLQALGKIESEGLLSLTTQLVKEKLSTCEAEDIATYEQNLQQWHLDL 280 290 300 310 320 330 390 400 410 pF1KSD VQLIQREQQQREQAKQEYQAQKAAKASA :::::::::::::::::::::::::::: NP_001 VQLIQREQQQREQAKQEYQAQKAAKASA 340 350 >>NP_001273677 (OMIM: 611989) 28S ribosomal protein S27, (428 aa) initn: 2091 init1: 2091 opt: 2091 Z-score: 1974.4 bits: 374.4 E(85289): 3.1e-103 Smith-Waterman score: 2651; 96.5% identity (96.5% similar) in 428 aa overlap (1-414:1-428) 10 20 30 40 50 60 pF1KSD MAASIVRRGMLLARQVVLPQLSPAGKRYLLSSAYVDSHKWEAREKEHYCLADLASLMDKT :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 MAASIVRRGMLLARQVVLPQLSPAGKRYLLSSAYVDSHKWEAREKEHYCLADLASLMDKT 10 20 30 40 50 60 70 80 90 100 pF1KSD FERKLPVSSLTISRLIDNISSREEIDHAEYYLYK--------------FRHSPNCWYLRN :::::::::::::::::::::::::::::::::: :::::::::::: NP_001 FERKLPVSSLTISRLIDNISSREEIDHAEYYLYKKGECSSSNPQNYSRFRHSPNCWYLRN 70 80 90 100 110 120 110 120 130 140 150 160 pF1KSD WTIHTWIRQCLKYDAQDKALYTLVNKVQYGIFPDNFTFNLLMDSFIKKENYKDALSVVFE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 WTIHTWIRQCLKYDAQDKALYTLVNKVQYGIFPDNFTFNLLMDSFIKKENYKDALSVVFE 130 140 150 160 170 180 170 180 190 200 210 220 pF1KSD VMMQEAFEVPSTQLLSLYVLFHCLAKKTDFSWEEERNFGASLLLPGLKQKNSVGFSSQLY :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 VMMQEAFEVPSTQLLSLYVLFHCLAKKTDFSWEEERNFGASLLLPGLKQKNSVGFSSQLY 190 200 210 220 230 240 230 240 250 260 270 280 pF1KSD GYALLGKVELQQGLRAVYHNMPLIWKPGYLDRALQVMEKVAASPEDIKLCREALDVLDAV ::::::::::::::::::::::::::::::::::::::::::::::::::::::::: :: NP_001 GYALLGKVELQQGLRAVYHNMPLIWKPGYLDRALQVMEKVAASPEDIKLCREALDVLGAV 250 260 270 280 290 300 290 300 310 320 330 340 pF1KSD LKALTSADGASEEQSQNDEDNQGSEKLVEQLDIEETEQSKLPQYLERFKALHSKLQALGK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 LKALTSADGASEEQSQNDEDNQGSEKLVEQLDIEETEQSKLPQYLERFKALHSKLQALGK 310 320 330 340 350 360 350 360 370 380 390 400 pF1KSD IESEGLLSLTTQLVKEKLSTCEAEDIATYEQNLQQWHLDLVQLIQREQQQREQAKQEYQA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 IESEGLLSLTTQLVKEKLSTCEAEDIATYEQNLQQWHLDLVQLIQREQQQREQAKQEYQA 370 380 390 400 410 420 410 pF1KSD QKAAKASA :::::::: NP_001 QKAAKASA >>XP_005248658 (OMIM: 615484) PREDICTED: pentatricopepti (350 aa) initn: 103 init1: 78 opt: 192 Z-score: 193.8 bits: 44.6 E(85289): 0.00047 Smith-Waterman score: 200; 21.9% identity (56.6% similar) in 288 aa overlap (23-281:40-322) 10 20 30 40 50 pF1KSD MAASIVRRGMLLARQVVLPQLSPAG-KRYLLSSAYVDSHKWEAREKEHYC-L : : :::::.. : .... .. : : XP_005 FRPSNRVLLQALQILVYPGVGGSGSVSCRCPLGAKRYLLTDNVVKLKEFQQKKVAVACNL 10 20 30 40 50 60 60 70 80 90 100 pF1KSD ADLASLMDKTFERKLPVSSLTIS----RLIDNISSREEIDHAEYYLYKFRHSPNCWYLRN . . .....:: ..: .. :. ::.... :. .:.. :. : .: XP_005 SGTKETYFRNLKKKLTQNKLILKGELITLLHLCESRDHVELAKNVIYRY-HAEN----KN 70 80 90 100 110 120 110 120 130 140 150 160 pF1KSD WTIHTW------IRQCLKYDAQDKALYTLVNKVQYGIFPDNFTFNLLMDSFIKKENYKDA .:. . .: : . : ...:. . .. :.: :. .::.::: .. : .::.: XP_005 FTLGEYKFGPLFVRLCYELDLEESAVELMKDQHLRGFFSDSTSFNILMDMLFIKGKYKSA 130 140 150 160 170 180 170 180 190 200 pF1KSD LSVVFEVMMQEAFEVPSTQLLSLYVLFH---------CLAKKTDFSWEEE------RNFG :.:..:. :.. . .: .:.. . .. : . . . . : :. XP_005 LQVLIEMKNQDVKFTKDTYVLAFAICYKLNSPESFKICTTLREEALLKGEILSRRASCFA 190 200 210 220 230 240 210 220 230 240 250 260 pF1KSD ASLLLPGLKQKNSVGFSSQLYGYALLGKVELQQG-LRAVYHNMP-LIWKPGYLDRALQVM ..: : .. ..:.. ::... .. ..:. . .: ...: :. : . .:.. XP_005 VALALNQNEMAKAVSIFSQIMNPESIACINLNLAKVREKVKDVPALVAKFDEIYGTLHIT 250 260 270 280 290 300 270 280 290 300 310 320 pF1KSD EKVAASPEDIKLCREALDVLDAVLKALTSADGASEEQSQNDEDNQGSEKLVEQLDIEETE .:... : ::. : XP_005 GQVTTDSLDAVLCHTPRDRKSHTLLLNKRMVSRRTFQPLSQSLLAE 310 320 330 340 350 >>NP_079030 (OMIM: 615484) pentatricopeptide repeat-cont (388 aa) initn: 103 init1: 78 opt: 192 Z-score: 193.1 bits: 44.6 E(85289): 0.00051 Smith-Waterman score: 194; 21.2% identity (55.9% similar) in 358 aa overlap (23-365:40-363) 10 20 30 40 50 pF1KSD MAASIVRRGMLLARQVVLPQLSPAG-KRYLLSSAYVDSHKWEAREKEHYC-L : : :::::.. : .... .. : : NP_079 FRPSNRVLLQALQILVYPGVGGSGSVSCRCPLGAKRYLLTDNVVKLKEFQQKKVAVACNL 10 20 30 40 50 60 60 70 80 90 100 pF1KSD ADLASLMDKTFERKLPVSSLTIS----RLIDNISSREEIDHAEYYLYKFRHSPNCWYLRN . . .....:: ..: .. :. ::.... :. .:.. :. : .: NP_079 SGTKETYFRNLKKKLTQNKLILKGELITLLHLCESRDHVELAKNVIYRY-HAEN----KN 70 80 90 100 110 120 110 120 130 140 150 160 pF1KSD WTIHTW------IRQCLKYDAQDKALYTLVNKVQYGIFPDNFTFNLLMDSFIKKENYKDA .:. . .: : . : ...:. . .. :.: :. .::.::: .. : .::.: NP_079 FTLGEYKFGPLFVRLCYELDLEESAVELMKDQHLRGFFSDSTSFNILMDMLFIKGKYKSA 130 140 150 160 170 180 170 180 190 200 210 pF1KSD LSVVFEVMMQEAFEVPSTQLLSLYVLFHCLAKKTDFSWEEERNFGASLLLPG--LKQKNS :.:..:. :.. . .: .:.. . : .. :.. .. :: : :... : NP_079 LQVLIEMKNQDVKFTKDTYVLAFAI---CYKLNSPESFKICTTLREEALLKGEILSRRAS 190 200 210 220 230 240 220 230 240 250 260 270 pF1KSD VGFSSQLYGYALLGKVELQQGLRAVYHNMPLIWKPGYLDRALQVMEKVAASPEDIKLCRE :. : :.. :. ... ... . : .: :..: .: . . NP_079 C-FAVALA----LNQNEMAKAV-SIFSQ---IMNP----------ESIACINLNIIIHIQ 250 260 270 280 280 290 300 310 320 330 pF1KSD ALDVLDAVLKALTSADGASEEQSQNDEDNQGSEKLVEQLDIEETEQSK-LPQYLERFKAL . ..:. ..:.: .: : . :. . . ::... .. :. : .: . .: . NP_079 S-NMLENLIKTLKNA--AEGNLSKFVKRHVFSEEVLAKV----REKVKDVPALVAKFDEI 290 300 310 320 330 340 350 360 370 380 390 pF1KSD HSKLQALGKIESEGLLSLTTQLVKEKLSTCEAEDIATYEQNLQQWHLDLVQLIQREQQQR .. :. :.. ...: .. . ... : NP_079 YGTLHITGQVTTDSLDAVLCHTPRDRKSHTLLLNKRMVSRRTFQPLSQSLLAE 340 350 360 370 380 414 residues in 1 query sequences 60827320 residues in 85289 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Thu Nov 3 00:59:17 2016 done: Thu Nov 3 00:59:18 2016 Total Scan time: 7.820 Total Display time: 0.000 Function used was FASTA [36.3.4 Apr, 2011]