FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KB3313, 553 aa 1>>>pF1KB3313 553 - 553 aa - 553 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 6.9843+/-0.00104; mu= 12.2504+/- 0.062 mean_var=154.8290+/-30.913, 0's: 0 Z-trim(108.9): 33 B-trim: 250 in 1/52 Lambda= 0.103074 statistics sampled from 10470 (10496) to 10470 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.683), E-opt: 0.2 (0.322), width: 16 Scan time: 3.520 The best scores are: opt bits E(32554) CCDS10524.1 GLYR1 gene_id:84656|Hs108|chr16 ( 553) 3668 557.8 1.3e-158 CCDS81945.1 GLYR1 gene_id:84656|Hs108|chr16 ( 547) 3600 547.6 1.4e-155 CCDS5414.1 HIBADH gene_id:11112|Hs108|chr7 ( 336) 381 68.8 1.2e-11 >>CCDS10524.1 GLYR1 gene_id:84656|Hs108|chr16 (553 aa) initn: 3668 init1: 3668 opt: 3668 Z-score: 2961.4 bits: 557.8 E(32554): 1.3e-158 Smith-Waterman score: 3668; 100.0% identity (100.0% similar) in 553 aa overlap (1-553:1-553) 10 20 30 40 50 60 pF1KB3 MAAVSLRLGDLVWGKLGRYPPWPGKIVNPPKDLKKPRGKKCFFVKFFGTEDHAWIKVEQL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS10 MAAVSLRLGDLVWGKLGRYPPWPGKIVNPPKDLKKPRGKKCFFVKFFGTEDHAWIKVEQL 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB3 KPYHAHKEEMIKINKGKRFQQAVDAVEEFLRRAKGKDQTSSHNSSDDKNRRNSSEERSRP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS10 KPYHAHKEEMIKINKGKRFQQAVDAVEEFLRRAKGKDQTSSHNSSDDKNRRNSSEERSRP 70 80 90 100 110 120 130 140 150 160 170 180 pF1KB3 NSGDEKRKLSLSEGKVKKNMGEGKKRVSSGSSERGSKSPLKRAQEQSPRKRGRPPKDEKD :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS10 NSGDEKRKLSLSEGKVKKNMGEGKKRVSSGSSERGSKSPLKRAQEQSPRKRGRPPKDEKD 130 140 150 160 170 180 190 200 210 220 230 240 pF1KB3 LTIPESSTVKGMMAGPMAAFKWQPTASEPVKDADPHFHHFLLSQTEKPAVCYQAITKKLK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS10 LTIPESSTVKGMMAGPMAAFKWQPTASEPVKDADPHFHHFLLSQTEKPAVCYQAITKKLK 190 200 210 220 230 240 250 260 270 280 290 300 pF1KB3 ICEEETGSTSIQAADSTAVNGSITPTDKKIGFLGLGLMGSGIVSNLLKMGHTVTVWNRTA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS10 ICEEETGSTSIQAADSTAVNGSITPTDKKIGFLGLGLMGSGIVSNLLKMGHTVTVWNRTA 250 260 270 280 290 300 310 320 330 340 350 360 pF1KB3 EKCDLFIQEGARLGRTPAEVVSTCDITFACVSDPKAAKDLVLGPSGVLQGIRPGKCYVDM :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS10 EKCDLFIQEGARLGRTPAEVVSTCDITFACVSDPKAAKDLVLGPSGVLQGIRPGKCYVDM 310 320 330 340 350 360 370 380 390 400 410 420 pF1KB3 STVDADTVTELAQVIVSRGGRFLEAPVSGNQQLSNDGMLVILAAGDRGLYEDCSSCFQAM :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS10 STVDADTVTELAQVIVSRGGRFLEAPVSGNQQLSNDGMLVILAAGDRGLYEDCSSCFQAM 370 380 390 400 410 420 430 440 450 460 470 480 pF1KB3 GKTSFFLGEVGNAAKMMLIVNMVQGSFMATIAEGLTLAQVTGQSQQTLLDILNQGQLASI :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS10 GKTSFFLGEVGNAAKMMLIVNMVQGSFMATIAEGLTLAQVTGQSQQTLLDILNQGQLASI 430 440 450 460 470 480 490 500 510 520 530 540 pF1KB3 FLDQKCQNILQGNFKPDFYLKYIQKDLRLAIALGDAVNHPTPMAAAANEVYKRAKALDQS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS10 FLDQKCQNILQGNFKPDFYLKYIQKDLRLAIALGDAVNHPTPMAAAANEVYKRAKALDQS 490 500 510 520 530 540 550 pF1KB3 DNDMSAVYRAYIH ::::::::::::: CCDS10 DNDMSAVYRAYIH 550 >>CCDS81945.1 GLYR1 gene_id:84656|Hs108|chr16 (547 aa) initn: 2028 init1: 2028 opt: 3600 Z-score: 2906.8 bits: 547.6 E(32554): 1.4e-155 Smith-Waterman score: 3600; 98.9% identity (98.9% similar) in 553 aa overlap (1-553:1-547) 10 20 30 40 50 60 pF1KB3 MAAVSLRLGDLVWGKLGRYPPWPGKIVNPPKDLKKPRGKKCFFVKFFGTEDHAWIKVEQL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS81 MAAVSLRLGDLVWGKLGRYPPWPGKIVNPPKDLKKPRGKKCFFVKFFGTEDHAWIKVEQL 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB3 KPYHAHKEEMIKINKGKRFQQAVDAVEEFLRRAKGKDQTSSHNSSDDKNRRNSSEERSRP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS81 KPYHAHKEEMIKINKGKRFQQAVDAVEEFLRRAKGKDQTSSHNSSDDKNRRNSSEERSRP 70 80 90 100 110 120 130 140 150 160 170 180 pF1KB3 NSGDEKRKLSLSEGKVKKNMGEGKKRVSSGSSERGSKSPLKRAQEQSPRKRGRPPKDEKD :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS81 NSGDEKRKLSLSEGKVKKNMGEGKKRVSSGSSERGSKSPLKRAQEQSPRKRGRPPKDEKD 130 140 150 160 170 180 190 200 210 220 230 240 pF1KB3 LTIPESSTVKGMMAGPMAAFKWQPTASEPVKDADPHFHHFLLSQTEKPAVCYQAITKKLK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS81 LTIPESSTVKGMMAGPMAAFKWQPTASEPVKDADPHFHHFLLSQTEKPAVCYQAITKKLK 190 200 210 220 230 240 250 260 270 280 290 300 pF1KB3 ICEEETGSTSIQAADSTAVNGSITPTDKKIGFLGLGLMGSGIVSNLLKMGHTVTVWNRTA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS81 ICEEETGSTSIQAADSTAVNGSITPTDKKIGFLGLGLMGSGIVSNLLKMGHTVTVWNRTA 250 260 270 280 290 300 310 320 330 340 350 360 pF1KB3 EKCDLFIQEGARLGRTPAEVVSTCDITFACVSDPKAAKDLVLGPSGVLQGIRPGKCYVDM :: :::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS81 EK------EGARLGRTPAEVVSTCDITFACVSDPKAAKDLVLGPSGVLQGIRPGKCYVDM 310 320 330 340 350 370 380 390 400 410 420 pF1KB3 STVDADTVTELAQVIVSRGGRFLEAPVSGNQQLSNDGMLVILAAGDRGLYEDCSSCFQAM :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS81 STVDADTVTELAQVIVSRGGRFLEAPVSGNQQLSNDGMLVILAAGDRGLYEDCSSCFQAM 360 370 380 390 400 410 430 440 450 460 470 480 pF1KB3 GKTSFFLGEVGNAAKMMLIVNMVQGSFMATIAEGLTLAQVTGQSQQTLLDILNQGQLASI :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS81 GKTSFFLGEVGNAAKMMLIVNMVQGSFMATIAEGLTLAQVTGQSQQTLLDILNQGQLASI 420 430 440 450 460 470 490 500 510 520 530 540 pF1KB3 FLDQKCQNILQGNFKPDFYLKYIQKDLRLAIALGDAVNHPTPMAAAANEVYKRAKALDQS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS81 FLDQKCQNILQGNFKPDFYLKYIQKDLRLAIALGDAVNHPTPMAAAANEVYKRAKALDQS 480 490 500 510 520 530 550 pF1KB3 DNDMSAVYRAYIH ::::::::::::: CCDS81 DNDMSAVYRAYIH 540 >>CCDS5414.1 HIBADH gene_id:11112|Hs108|chr7 (336 aa) initn: 324 init1: 324 opt: 381 Z-score: 322.5 bits: 68.8 E(32554): 1.2e-11 Smith-Waterman score: 381; 25.3% identity (56.9% similar) in 304 aa overlap (253-549:25-328) 230 240 250 260 270 280 pF1KB3 SQTEKPAVCYQAITKKLKICEEETGSTSIQAADSTAVNGSITPTDKKIGFLGLGLMGSGI :.. .:: . . . .::.::: ::. . CCDS54 MAASLRLLGAASGLRYWSRRLRPAAGSFAAVCSRSVASKTPVGFIGLGNMGNPM 10 20 30 40 50 290 300 310 320 330 340 pF1KB3 VSNLLKMGHTVTVWNRTAEKCDLFIQEGARLGRTPAEVVSTCDITFACVSDPKAAKDLVL ..::.: :. . ... . : : . : .. .::.:. : .. . : . CCDS54 AKNLMKHGYPLIIYDVFPDACKEFQDAGEQVVSSPADVAEKADRIITMLPTSINAIEAYS 60 70 80 90 100 110 350 360 370 380 390 400 pF1KB3 GPSGVLQGIRPGKCYVDMSTVDADTVTELAQVIVSRGGRFLEAPVSGNQQLSNDGMLVIL : .:.:. .. :. .: ::.: . :::. . . :. :..:::::. . .: :... CCDS54 GANGILKKVKKGSLLIDSSTIDPAVSKELAKEVEKMGAVFMDAPVSGGVGAARSGNLTFM 120 130 140 150 160 170 410 420 430 440 450 460 pF1KB3 AAGDRGLYEDCSSCFQAMGKTSFFLGEVGNAAKMMLIVNMVQGSFMATIAEGLTLAQVTG ..: . . . . ::.. . : ::.. . ::. . : ::...:. : CCDS54 VGGVEDEFAAAQELLGCMGSNVVYCGAVGTGQAAKICNNMLLAISMIGTAEAMNLGIRLG 180 190 200 210 220 230 470 480 490 500 510 pF1KB3 QSQQTLLDILNQ--GQLASIFLDQKCQNILQG-----NFKPDFYLKYIQKDLRLAIALGD . . : :::. :. : . ....: :.. : . ::: :: . CCDS54 LDPKLLAKILNMSSGRCWSSDTYNPVPGVMDGVPSANNYQGGFGTTLMAKDLGLAQDSAT 240 250 260 270 280 290 520 530 540 550 pF1KB3 AVNHPTPMAAAANEVYKRAKALDQSDNDMSAVYRAYIH ... : ... :...:. : : .:.:.:.. CCDS54 STKSPILLGSLAHQIYRMMCAKGYSKKDFSSVFQFLREEETF 300 310 320 330 553 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Sat Nov 5 04:54:48 2016 done: Sat Nov 5 04:54:49 2016 Total Scan time: 3.520 Total Display time: 0.020 Function used was FASTA [36.3.4 Apr, 2011]