FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KB4126, 433 aa 1>>>pF1KB4126 433 - 433 aa - 433 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 5.6347+/-0.00119; mu= 15.9945+/- 0.070 mean_var=100.7416+/-26.354, 0's: 0 Z-trim(102.1): 148 B-trim: 827 in 2/48 Lambda= 0.127782 statistics sampled from 6587 (6784) to 6587 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.551), E-opt: 0.2 (0.208), width: 16 Scan time: 2.800 The best scores are: opt bits E(32554) CCDS5744.1 GPR22 gene_id:2845|Hs108|chr7 ( 433) 2782 524.2 9.5e-149 CCDS3438.1 CCKAR gene_id:886|Hs108|chr4 ( 428) 339 73.8 3.6e-13 >>CCDS5744.1 GPR22 gene_id:2845|Hs108|chr7 (433 aa) initn: 2782 init1: 2782 opt: 2782 Z-score: 2784.0 bits: 524.2 E(32554): 9.5e-149 Smith-Waterman score: 2782; 100.0% identity (100.0% similar) in 433 aa overlap (1-433:1-433) 10 20 30 40 50 60 pF1KB4 MCFSPILEINMQSESNITVRDDIDDINTNMYQPLSYPLSFQVSLTGFLMLEIVLGLGSNL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS57 MCFSPILEINMQSESNITVRDDIDDINTNMYQPLSYPLSFQVSLTGFLMLEIVLGLGSNL 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB4 TVLVLYCMKSNLINSVSNIITMNLHVLDVIICVGCIPLTIVILLLSLESNTALICCFHEA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS57 TVLVLYCMKSNLINSVSNIITMNLHVLDVIICVGCIPLTIVILLLSLESNTALICCFHEA 70 80 90 100 110 120 130 140 150 160 170 180 pF1KB4 CVSFASVSTAINVFAITLDRYDISVKPANRILTMGRAVMLMISIWIFSFFSFLIPFIEVN :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS57 CVSFASVSTAINVFAITLDRYDISVKPANRILTMGRAVMLMISIWIFSFFSFLIPFIEVN 130 140 150 160 170 180 190 200 210 220 230 240 pF1KB4 FFSLQSGNTWENKTLLCVSTNEYYTELGMYYHLLVQIPIFFFTVVVMLITYTKILQALNI :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS57 FFSLQSGNTWENKTLLCVSTNEYYTELGMYYHLLVQIPIFFFTVVVMLITYTKILQALNI 190 200 210 220 230 240 250 260 270 280 290 300 pF1KB4 RIGTRFSTGQKKKARKKKTISLTTQHEATDMSQSSGGRNVVFGVRTSVSVIIALRRAVKR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS57 RIGTRFSTGQKKKARKKKTISLTTQHEATDMSQSSGGRNVVFGVRTSVSVIIALRRAVKR 250 260 270 280 290 300 310 320 330 340 350 360 pF1KB4 HRERRERQKRVFRMSLLIISTFLLCWTPISVLNTTILCLGPSDLLVKLRLCFLVMAYGTT :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS57 HRERRERQKRVFRMSLLIISTFLLCWTPISVLNTTILCLGPSDLLVKLRLCFLVMAYGTT 310 320 330 340 350 360 370 380 390 400 410 420 pF1KB4 IFHPLLYAFTRQKFQKVLKSKMKKRVVSIVEADPLPNNAVIHNSWIDPKRNKKITFEDSE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS57 IFHPLLYAFTRQKFQKVLKSKMKKRVVSIVEADPLPNNAVIHNSWIDPKRNKKITFEDSE 370 380 390 400 410 420 430 pF1KB4 IREKCLVPQVVTD ::::::::::::: CCDS57 IREKCLVPQVVTD 430 >>CCDS3438.1 CCKAR gene_id:886|Hs108|chr4 (428 aa) initn: 168 init1: 96 opt: 339 Z-score: 350.0 bits: 73.8 E(32554): 3.6e-13 Smith-Waterman score: 339; 23.9% identity (59.5% similar) in 348 aa overlap (41-375:44-378) 20 30 40 50 60 70 pF1KB4 MQSESNITVRDDIDDINTNMYQPLSYPLSFQVSLTGFLMLEIVLGLGSNLTVLVLYCMKS :. : ....: ::: ..:.. :: ... CCDS34 ITPPCELGLENETLFCLDQPRPSKEWQPAVQILLYSLIFLLSVLG--NTLVITVL--IRN 20 30 40 50 60 80 90 100 110 120 130 pF1KB4 NLINSVSNIITMNLHVLDVIICVGCIPLTIVILLLSLESNTALICCFHEACVSFASVSTA . . .:.::. ..: : :...:. :.:.... ::. . .: .. . : ..:.. CCDS34 KRMRTVTNIFLLSLAVSDLMLCLFCMPFNLIPNLLKDFIFGSAVC---KTTTYFMGTSVS 70 80 90 100 110 120 140 150 160 170 180 pF1KB4 INVF---AITLDRYDISVKP-ANRIL-TMGRAVMLMISIWIFSFFSFLIPF-IEVNFFSL ...: ::.:.:: :: .:. : ..:. .. . : .:: ... :. : :. . CCDS34 VSTFNLVAISLERYGAICKPLQSRVWQTKSHALKVIAATWCLSF-TIMTPYPIYSNLVPF 130 140 150 160 170 180 190 200 210 220 230 240 pF1KB4 QSGNTWENKTLLCVSTNEYYTELGMYYHLLVQIPIFFFTVVVMLITYTKILQALNIRIGT ..:. . . :. . . .: .. . .:.. .::...: : .:.. : CCDS34 TKNNNQTANMCRFLLPNDV---MQQSWHTFLLLILFLIPGIVMMVAYGLI--SLELYQGI 190 200 210 220 230 240 250 260 270 280 290 300 pF1KB4 RFSTGQKKKARKKKTISLTT-QHEATD---MSQSSGGRNVVFGVRTSVSVIIALRRAVKR .: ..:::.:...: . .. ..: .: .... :.. . .. : : : . CCDS34 KFEASQKKSAKERKPSTTSSGKYEDSDGCYLQKTRPPRKLELRQLSTGSSSRANRIRSNS 250 260 270 280 290 300 310 320 330 340 350 pF1KB4 HRERRERQKRVFRMSLLIISTFLLCWTPISVLNT--TILCLGPSDLLVKLRLCF-LVMAY .:::.:: ..:. :.::: :: :. . . : . : :...: CCDS34 SAANLMAKKRVIRMLIVIVVLFFLCWMPIFSANAWRAYDTASAERRLSGTPISFILLLSY 310 320 330 340 350 360 360 370 380 390 400 410 pF1KB4 GTTIFHPLLYAFTRQKFQKVLKSKMKKRVVSIVEADPLPNNAVIHNSWIDPKRNKKITFE .. .:..: : ..:. CCDS34 TSSCVNPIIYCFMNKRFRLGFMATFPCCPNPGPPGARGEVGEEEEGGTTGASLSRFSYSH 370 380 390 400 410 420 433 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Thu Nov 3 14:27:57 2016 done: Thu Nov 3 14:27:57 2016 Total Scan time: 2.800 Total Display time: 0.000 Function used was FASTA [36.3.4 Apr, 2011]