FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KA1159, 252 aa 1>>>pF1KA1159 252 - 252 aa - 252 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 7.0256+/-0.000692; mu= 7.8388+/- 0.042 mean_var=126.7577+/-25.170, 0's: 0 Z-trim(114.9): 8 B-trim: 0 in 0/50 Lambda= 0.113917 statistics sampled from 15478 (15485) to 15478 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.804), E-opt: 0.2 (0.476), width: 16 Scan time: 2.600 The best scores are: opt bits E(32554) CCDS11550.1 NXPH3 gene_id:11248|Hs108|chr17 ( 252) 1794 305.0 3.1e-83 CCDS47540.1 NXPH1 gene_id:30010|Hs108|chr7 ( 271) 847 149.4 2.3e-36 CCDS46421.1 NXPH2 gene_id:11249|Hs108|chr2 ( 264) 815 144.1 8.8e-35 CCDS8933.1 NXPH4 gene_id:11247|Hs108|chr12 ( 308) 375 71.9 5.8e-13 >>CCDS11550.1 NXPH3 gene_id:11248|Hs108|chr17 (252 aa) initn: 1794 init1: 1794 opt: 1794 Z-score: 1607.8 bits: 305.0 E(32554): 3.1e-83 Smith-Waterman score: 1794; 100.0% identity (100.0% similar) in 252 aa overlap (1-252:1-252) 10 20 30 40 50 60 pF1KA1 MQLTRCCFVFLVQGSLYLVICGQDDGPPGSEDPERDDHEGQPRPRVPRKRGHISPKSRPM :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS11 MQLTRCCFVFLVQGSLYLVICGQDDGPPGSEDPERDDHEGQPRPRVPRKRGHISPKSRPM 10 20 30 40 50 60 70 80 90 100 110 120 pF1KA1 ANSTLLGLLAPPGEAWGILGQPPNRPNHSPPPSAKVKKIFGWGDFYSNIKTVALNLLVTG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS11 ANSTLLGLLAPPGEAWGILGQPPNRPNHSPPPSAKVKKIFGWGDFYSNIKTVALNLLVTG 70 80 90 100 110 120 130 140 150 160 170 180 pF1KA1 KIVDHGNGTFSVHFQHNATGQGNISISLVPPSKAVEFHQEQQIFIEAKASKIFNCRMEWE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS11 KIVDHGNGTFSVHFQHNATGQGNISISLVPPSKAVEFHQEQQIFIEAKASKIFNCRMEWE 130 140 150 160 170 180 190 200 210 220 230 240 pF1KA1 KVERGRRTSLCTHDPAKICSRDHAQSSATWSCSQPFKVVCVYIAFYSTDYRLVQKVCPDY :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS11 KVERGRRTSLCTHDPAKICSRDHAQSSATWSCSQPFKVVCVYIAFYSTDYRLVQKVCPDY 190 200 210 220 230 240 250 pF1KA1 NYHSDTPYYPSG :::::::::::: CCDS11 NYHSDTPYYPSG 250 >>CCDS47540.1 NXPH1 gene_id:30010|Hs108|chr7 (271 aa) initn: 906 init1: 838 opt: 847 Z-score: 766.2 bits: 149.4 E(32554): 2.3e-36 Smith-Waterman score: 847; 65.1% identity (86.9% similar) in 175 aa overlap (79-252:97-271) 50 60 70 80 90 100 pF1KA1 KRGHISPKSRPMANSTLLGLLAPPGEAWGILGQPPNRPNHSPP-PSAKVKKIFGWGDFYS : .: : .. : ..: ::.::::::.: CCDS47 ENDTDLDLRYDTPEPYSEQDLWDWLRNSTDLQEPRPRAKRRPIVKTGKFKKMFGWGDFHS 70 80 90 100 110 120 110 120 130 140 150 160 pF1KA1 NIKTVALNLLVTGKIVDHGNGTFSVHFQHNATGQGNISISLVPPSKAVEFHQEQQIFIEA ::::: ::::.::::::::::::::.:.::.:::::.:.:::::.: ::: :: :.: CCDS47 NIKTVKLNLLITGKIVDHGNGTFSVYFRHNSTGQGNVSVSLVPPTKIVEFDLAQQTVIDA 130 140 150 160 170 180 170 180 190 200 210 220 pF1KA1 KASKIFNCRMEWEKVERGRRTSLCTHDPAKICSRDHAQSSATWSCSQPFKVVCVYIAFYS : :: ::::.:.:::... ...::..::.: : ....:: ..: ::.::::.:.::.::: CCDS47 KDSKSFNCRIEYEKVDKATKNTLCNYDPSKTCYQEQTQSHVSWLCSKPFKVICIYISFYS 190 200 210 220 230 240 230 240 250 pF1KA1 TDYRLVQKVCPDYNYHSDTPYYPSG :::.:::::::::::::::::.::: CCDS47 TDYKLVQKVCPDYNYHSDTPYFPSG 250 260 270 >>CCDS46421.1 NXPH2 gene_id:11249|Hs108|chr2 (264 aa) initn: 826 init1: 806 opt: 815 Z-score: 737.9 bits: 144.1 E(32554): 8.8e-35 Smith-Waterman score: 822; 50.8% identity (72.3% similar) in 256 aa overlap (11-252:10-264) 10 20 30 40 50 pF1KA1 MQLTRCCFVFLVQGSLYLVICGQDDGPPGSE--DPERDDHEGQPRPRVPRKRGHISP--- .: : : :..: . . ..: : : : : : ..: ::: CCDS46 MRLRPLPLVVVPGLLQLLFCDSKEVVHATEGLDWEDKDAPGTLVGNVVHSRI-ISPLRL 10 20 30 40 50 60 70 80 90 100 pF1KA1 --KSRPMANSTLLGLLAPPGEAWGILG------QPPNRPNHSPP-PSAKVKKIFGWGDFY :. :. . .. . : :. .: : .. : ..: ::.::::::. CCDS46 FVKQSPVPKPGPMAYADSMENFWDWLANITEIQEPLARTKRRPIVKTGKFKKMFGWGDFH 60 70 80 90 100 110 110 120 130 140 150 160 pF1KA1 SNIKTVALNLLVTGKIVDHGNGTFSVHFQHNATGQGNISISLVPPSKAVEFHQEQQIFIE :::::: ::::.::::::::::::::.:.::.:: ::.:.:::::::.:::. : .: CCDS46 SNIKTVKLNLLITGKIVDHGNGTFSVYFRHNSTGLGNVSVSLVPPSKVVEFEVSPQSTLE 120 130 140 150 160 170 170 180 190 200 210 220 pF1KA1 AKASKIFNCRMEWEKVERGRRTSLCTHDPAKICSRDHAQSSATWSCSQPFKVVCVYIAFY .: :: ::::.:.::..:...:.::. ::.::: ....:: ..: ::.::::.:.::::: CCDS46 TKESKSFNCRIEYEKTDRAKKTALCNFDPSKICYQEQTQSHVSWLCSKPFKVICIYIAFY 180 190 200 210 220 230 230 240 250 pF1KA1 STDYRLVQKVCPDYNYHSDTPYYPSG :.::.:::::::::::::.::: :: CCDS46 SVDYKLVQKVCPDYNYHSETPYLSSG 240 250 260 >>CCDS8933.1 NXPH4 gene_id:11247|Hs108|chr12 (308 aa) initn: 693 init1: 375 opt: 375 Z-score: 346.2 bits: 71.9 E(32554): 5.8e-13 Smith-Waterman score: 598; 39.7% identity (59.9% similar) in 277 aa overlap (26-249:44-307) 10 20 30 40 50 pF1KA1 MQLTRCCFVFLVQGSLYLVICGQDDGPPGSEDPERDDHEGQPRPRVPRKRGHISP : ::.. :: :: :. CCDS89 PWLLRKAVSAQIPESGRPQYLGLRPAAAGAGAPGQQLPE-------PRSSDGLGVGRAWS 20 30 40 50 60 60 70 80 90 100 110 pF1KA1 KSRPMANSTLLGLLAPPGEAWGILGQPPNRPNHSPP-PSAKVKKIFGWGDFYSNIKTVAL . : .: : : :: : : : : : .: ...: .:..:::::::::: ..:. . CCDS89 WAWP-TNHT--GALARAGAA-GAL--PAQRTKRKPSIKAARAKKIFGWGDFYFRVHTLKF 70 80 90 100 110 120 120 130 140 150 160 pF1KA1 NLLVTGKIVDHGNGTFSVHFQHNATGQGNISISLVPPSKAVEF----------HQEQQIF .:::::::::: ::::::.:.::... ::.:.:.::::: ::: : :. . CCDS89 SLLVTGKIVDHVNGTFSVYFRHNSSSLGNLSVSIVPPSKRVEFGGVWLPGPVPHPLQSTL 130 140 150 160 170 180 170 180 pF1KA1 -IE-----------------------------------------AKASKIFNCRMEWEKV .: :: :. :::..:.::. CCDS89 ALEGVLPGLGPPLGMAAAAAGPGLGGSLGGALAGPLGGALGVPGAKESRAFNCHVEYEKT 190 200 210 220 230 240 190 200 210 220 230 240 pF1KA1 ERGRRTSLCTHDPAKICSRDHAQSSATWSCSQPFKVVCVYIAFYSTDYRLVQKVCPDYNY .:.:. : .::...: .:.::.:.: :..::::.:....: : ::.::::::::::. CCDS89 NRARKHRPCLYDPSQVCFTEHTQSQAAWLCAKPFKVICIFVSFLSFDYKLVQKVCPDYNF 250 260 270 280 290 300 250 pF1KA1 HSDTPYYPSG .:. ::. CCDS89 QSEHPYFG 252 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Thu Nov 3 19:54:01 2016 done: Thu Nov 3 19:54:01 2016 Total Scan time: 2.600 Total Display time: -0.020 Function used was FASTA [36.3.4 Apr, 2011]