FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KB5884, 496 aa 1>>>pF1KB5884 496 - 496 aa - 496 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 8.5479+/-0.000928; mu= 5.3656+/- 0.056 mean_var=263.1873+/-54.220, 0's: 0 Z-trim(114.4): 11 B-trim: 245 in 2/50 Lambda= 0.079057 statistics sampled from 14972 (14981) to 14972 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.778), E-opt: 0.2 (0.46), width: 16 Scan time: 3.440 The best scores are: opt bits E(32554) CCDS2553.1 NEU4 gene_id:129807|Hs108|chr2 ( 496) 3527 415.5 6.8e-116 CCDS54441.1 NEU4 gene_id:129807|Hs108|chr2 ( 497) 3515 414.1 1.8e-115 CCDS54442.1 NEU4 gene_id:129807|Hs108|chr2 ( 484) 3443 405.9 5.1e-113 CCDS44682.1 NEU3 gene_id:10825|Hs108|chr11 ( 461) 957 122.3 1.1e-27 CCDS2501.1 NEU2 gene_id:4759|Hs108|chr2 ( 380) 809 105.3 1.2e-22 >>CCDS2553.1 NEU4 gene_id:129807|Hs108|chr2 (496 aa) initn: 3527 init1: 3527 opt: 3527 Z-score: 2194.1 bits: 415.5 E(32554): 6.8e-116 Smith-Waterman score: 3527; 100.0% identity (100.0% similar) in 496 aa overlap (1-496:1-496) 10 20 30 40 50 60 pF1KB5 MMSSAAFPRWLSMGVPRTPSRTVLFERERTGLTYRVPSLLPVPPGPTLLAFVEQRLSPDD :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS25 MMSSAAFPRWLSMGVPRTPSRTVLFERERTGLTYRVPSLLPVPPGPTLLAFVEQRLSPDD 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB5 SHAHRLVLRRGTLAGGSVRWGALHVLGTAALAEHRSMNPCPVHDAGTGTVFLFFIAVLGH :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS25 SHAHRLVLRRGTLAGGSVRWGALHVLGTAALAEHRSMNPCPVHDAGTGTVFLFFIAVLGH 70 80 90 100 110 120 130 140 150 160 170 180 pF1KB5 TPEAVQIATGRNAARLCCVASRDAGLSWGSARDLTEEAIGGAVQDWATFAVGPGHGVQLP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS25 TPEAVQIATGRNAARLCCVASRDAGLSWGSARDLTEEAIGGAVQDWATFAVGPGHGVQLP 130 140 150 160 170 180 190 200 210 220 230 240 pF1KB5 SGRLLVPAYTYRVDRRECFGKICRTSPHSFAFYSDDHGRTWRCGGLVPNLRSGECQLAAV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS25 SGRLLVPAYTYRVDRRECFGKICRTSPHSFAFYSDDHGRTWRCGGLVPNLRSGECQLAAV 190 200 210 220 230 240 250 260 270 280 290 300 pF1KB5 DGGQAGSFLYCNARSPLGSRVQALSTDEGTSFLPAERVASLPETAWGCQGSIVGFPAPAP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS25 DGGQAGSFLYCNARSPLGSRVQALSTDEGTSFLPAERVASLPETAWGCQGSIVGFPAPAP 250 260 270 280 290 300 310 320 330 340 350 360 pF1KB5 NRPRDDSWSVGPGSPLQPPLLGPGVHEPPEEAAVDPRGGQVPGGPFSRLQPRGDGPRQPG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS25 NRPRDDSWSVGPGSPLQPPLLGPGVHEPPEEAAVDPRGGQVPGGPFSRLQPRGDGPRQPG 310 320 330 340 350 360 370 380 390 400 410 420 pF1KB5 PRPGVSGDVGSWTLALPMPFAAPPQSPTWLLYSHPVGRRARLHMGIRLSQSPLDPRSWTE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS25 PRPGVSGDVGSWTLALPMPFAAPPQSPTWLLYSHPVGRRARLHMGIRLSQSPLDPRSWTE 370 380 390 400 410 420 430 440 450 460 470 480 pF1KB5 PWVIYEGPSGYSDLASIGPAPEGGLVFACLYESGARTSYDEISFCTFSLREVLENVPASP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS25 PWVIYEGPSGYSDLASIGPAPEGGLVFACLYESGARTSYDEISFCTFSLREVLENVPASP 430 440 450 460 470 480 490 pF1KB5 KPPNLGDKPRGCCWPS :::::::::::::::: CCDS25 KPPNLGDKPRGCCWPS 490 >>CCDS54441.1 NEU4 gene_id:129807|Hs108|chr2 (497 aa) initn: 3513 init1: 3448 opt: 3515 Z-score: 2186.7 bits: 414.1 E(32554): 1.8e-115 Smith-Waterman score: 3515; 99.8% identity (99.8% similar) in 497 aa overlap (1-496:1-497) 10 20 30 40 50 pF1KB5 MMSSAAFPRWL-SMGVPRTPSRTVLFERERTGLTYRVPSLLPVPPGPTLLAFVEQRLSPD ::::::::::: :::::::::::::::::::::::::::::::::::::::::::::::: CCDS54 MMSSAAFPRWLQSMGVPRTPSRTVLFERERTGLTYRVPSLLPVPPGPTLLAFVEQRLSPD 10 20 30 40 50 60 60 70 80 90 100 110 pF1KB5 DSHAHRLVLRRGTLAGGSVRWGALHVLGTAALAEHRSMNPCPVHDAGTGTVFLFFIAVLG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS54 DSHAHRLVLRRGTLAGGSVRWGALHVLGTAALAEHRSMNPCPVHDAGTGTVFLFFIAVLG 70 80 90 100 110 120 120 130 140 150 160 170 pF1KB5 HTPEAVQIATGRNAARLCCVASRDAGLSWGSARDLTEEAIGGAVQDWATFAVGPGHGVQL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS54 HTPEAVQIATGRNAARLCCVASRDAGLSWGSARDLTEEAIGGAVQDWATFAVGPGHGVQL 130 140 150 160 170 180 180 190 200 210 220 230 pF1KB5 PSGRLLVPAYTYRVDRRECFGKICRTSPHSFAFYSDDHGRTWRCGGLVPNLRSGECQLAA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS54 PSGRLLVPAYTYRVDRRECFGKICRTSPHSFAFYSDDHGRTWRCGGLVPNLRSGECQLAA 190 200 210 220 230 240 240 250 260 270 280 290 pF1KB5 VDGGQAGSFLYCNARSPLGSRVQALSTDEGTSFLPAERVASLPETAWGCQGSIVGFPAPA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS54 VDGGQAGSFLYCNARSPLGSRVQALSTDEGTSFLPAERVASLPETAWGCQGSIVGFPAPA 250 260 270 280 290 300 300 310 320 330 340 350 pF1KB5 PNRPRDDSWSVGPGSPLQPPLLGPGVHEPPEEAAVDPRGGQVPGGPFSRLQPRGDGPRQP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS54 PNRPRDDSWSVGPGSPLQPPLLGPGVHEPPEEAAVDPRGGQVPGGPFSRLQPRGDGPRQP 310 320 330 340 350 360 360 370 380 390 400 410 pF1KB5 GPRPGVSGDVGSWTLALPMPFAAPPQSPTWLLYSHPVGRRARLHMGIRLSQSPLDPRSWT :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS54 GPRPGVSGDVGSWTLALPMPFAAPPQSPTWLLYSHPVGRRARLHMGIRLSQSPLDPRSWT 370 380 390 400 410 420 420 430 440 450 460 470 pF1KB5 EPWVIYEGPSGYSDLASIGPAPEGGLVFACLYESGARTSYDEISFCTFSLREVLENVPAS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS54 EPWVIYEGPSGYSDLASIGPAPEGGLVFACLYESGARTSYDEISFCTFSLREVLENVPAS 430 440 450 460 470 480 480 490 pF1KB5 PKPPNLGDKPRGCCWPS ::::::::::::::::: CCDS54 PKPPNLGDKPRGCCWPS 490 >>CCDS54442.1 NEU4 gene_id:129807|Hs108|chr2 (484 aa) initn: 3443 init1: 3443 opt: 3443 Z-score: 2142.5 bits: 405.9 E(32554): 5.1e-113 Smith-Waterman score: 3443; 100.0% identity (100.0% similar) in 484 aa overlap (13-496:1-484) 10 20 30 40 50 60 pF1KB5 MMSSAAFPRWLSMGVPRTPSRTVLFERERTGLTYRVPSLLPVPPGPTLLAFVEQRLSPDD :::::::::::::::::::::::::::::::::::::::::::::::: CCDS54 MGVPRTPSRTVLFERERTGLTYRVPSLLPVPPGPTLLAFVEQRLSPDD 10 20 30 40 70 80 90 100 110 120 pF1KB5 SHAHRLVLRRGTLAGGSVRWGALHVLGTAALAEHRSMNPCPVHDAGTGTVFLFFIAVLGH :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS54 SHAHRLVLRRGTLAGGSVRWGALHVLGTAALAEHRSMNPCPVHDAGTGTVFLFFIAVLGH 50 60 70 80 90 100 130 140 150 160 170 180 pF1KB5 TPEAVQIATGRNAARLCCVASRDAGLSWGSARDLTEEAIGGAVQDWATFAVGPGHGVQLP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS54 TPEAVQIATGRNAARLCCVASRDAGLSWGSARDLTEEAIGGAVQDWATFAVGPGHGVQLP 110 120 130 140 150 160 190 200 210 220 230 240 pF1KB5 SGRLLVPAYTYRVDRRECFGKICRTSPHSFAFYSDDHGRTWRCGGLVPNLRSGECQLAAV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS54 SGRLLVPAYTYRVDRRECFGKICRTSPHSFAFYSDDHGRTWRCGGLVPNLRSGECQLAAV 170 180 190 200 210 220 250 260 270 280 290 300 pF1KB5 DGGQAGSFLYCNARSPLGSRVQALSTDEGTSFLPAERVASLPETAWGCQGSIVGFPAPAP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS54 DGGQAGSFLYCNARSPLGSRVQALSTDEGTSFLPAERVASLPETAWGCQGSIVGFPAPAP 230 240 250 260 270 280 310 320 330 340 350 360 pF1KB5 NRPRDDSWSVGPGSPLQPPLLGPGVHEPPEEAAVDPRGGQVPGGPFSRLQPRGDGPRQPG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS54 NRPRDDSWSVGPGSPLQPPLLGPGVHEPPEEAAVDPRGGQVPGGPFSRLQPRGDGPRQPG 290 300 310 320 330 340 370 380 390 400 410 420 pF1KB5 PRPGVSGDVGSWTLALPMPFAAPPQSPTWLLYSHPVGRRARLHMGIRLSQSPLDPRSWTE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS54 PRPGVSGDVGSWTLALPMPFAAPPQSPTWLLYSHPVGRRARLHMGIRLSQSPLDPRSWTE 350 360 370 380 390 400 430 440 450 460 470 480 pF1KB5 PWVIYEGPSGYSDLASIGPAPEGGLVFACLYESGARTSYDEISFCTFSLREVLENVPASP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS54 PWVIYEGPSGYSDLASIGPAPEGGLVFACLYESGARTSYDEISFCTFSLREVLENVPASP 410 420 430 440 450 460 490 pF1KB5 KPPNLGDKPRGCCWPS :::::::::::::::: CCDS54 KPPNLGDKPRGCCWPS 470 480 >>CCDS44682.1 NEU3 gene_id:10825|Hs108|chr11 (461 aa) initn: 1160 init1: 657 opt: 957 Z-score: 610.3 bits: 122.3 E(32554): 1.1e-27 Smith-Waterman score: 1154; 42.6% identity (61.2% similar) in 469 aa overlap (24-489:46-455) 10 20 30 40 50 pF1KB5 MMSSAAFPRWLSMGVPRTPSRTVLFERERT-GLTYRVPSLLPVPPGPTLLAFV ::..: :.:::.:.:: .:: :.:::. CCDS44 ASSSAPTETEEPGSSAEVMEEVTTCSFNSPLFRQEDDRGITYRIPALLYIPPTHTFLAFA 20 30 40 50 60 70 60 70 80 90 100 110 pF1KB5 EQRLSPDDSHAHRLVLRRGTLAGGSVRWGALHVLGTAALAEHRSMNPCPVHDAGTGTVFL :.: . : : .:::::: : :.:: :. : :.: ::.:::::: . .: ::: CCDS44 EKRSTRRDEDALHLVLRRGLRIGQLVQWGPLKPLMEATLPGHRTMNPCPVWEQKSGCVFL 80 90 100 110 120 130 120 130 140 150 160 170 pF1KB5 FFIAVLGHTPEAVQIATGRNAARLCCVASRDAGLSWGSARDLTEEAIGGAVQDWATFAVG ::: : ::. : ::..:::::::: . :.::: ::. .::::::.::. .. ::::::: CCDS44 FFICVRGHVTERQQIVSGRNAARLCFIYSQDAGCSWSEVRDLTEEVIGSELKHWATFAVG 140 150 160 170 180 190 180 190 200 210 220 230 pF1KB5 PGHGVQLPSGRLLVPAYTYRVDRRE-CFGKICRTSPHSFAFYSDDHGRTWRCGGLVPNLR ::::.:: ::::..::::: . :: :.: :::. .:::: : ::. : :. . CCDS44 PGHGIQLQSGRLVIPAYTYYIPSWFFCFQLPCKTRPHSLMIYSDDLGVTWHHGRLIRPMV 200 210 220 230 240 250 240 250 260 270 280 290 pF1KB5 SGECQLAAVDGGQAGSFLYCNARSPLGSRVQALSTDEGTSFLPAERVASLPETAWGCQGS . ::..: : : . :::.::.: :..:::::.: .: .: : ::::: CCDS44 TVECEVAEVTGRAGHPVLYCSARTPNRCRAEALSTDHGEGFQRLALSRQLCEPPHGCQGS 260 270 280 290 300 310 300 310 320 330 340 350 pF1KB5 IVGF-PAPAPNRPRDDSWSVGPGSPLQPPLLGPGVHEPPEEAAVDPRGGQVPGGPFSRLQ .:.: : :.: .:.: :. CCDS44 VVSFRPLEIPHRCQDSS---------------------------------------SK-- 320 330 360 370 380 390 400 410 pF1KB5 PRGDGPRQPGPRPGVSGDVGSWTLALPMPFAAPPQSPTWLLYSHPVGRRARLHMGIRLSQ :.: :: : : . : : .:::::::..:. :. .:: :.: CCDS44 ---DAPTIQQSSPGSS---------LRLEEEAGTPSESWLLYSHPTSRKQRVDLGIYLNQ 340 350 360 370 380 420 430 440 450 460 470 pF1KB5 SPLDPRSWTEPWVIYEGPSGYSDLASIGPAPEGGLVFACLYESGARTSYDEISFCTFSLR .::. :..::... :: ::::::.. : :: :.::.: :.. ..:.: :. : CCDS44 TPLEAACWSRPWILHCGPCGYSDLAAL---EEEGL-FGCLFECGTKQECEQIAFRLFTHR 390 400 410 420 430 480 490 pF1KB5 EVLENVPASPKPPNLGDKPRGCCWPS :.: .. .. : : .: CCDS44 EILSHLQGDCTSP--GRNPSQFKSN 440 450 460 >>CCDS2501.1 NEU2 gene_id:4759|Hs108|chr2 (380 aa) initn: 966 init1: 363 opt: 809 Z-score: 520.1 bits: 105.3 E(32554): 1.2e-22 Smith-Waterman score: 880; 38.8% identity (56.6% similar) in 454 aa overlap (34-482:20-379) 10 20 30 40 50 60 pF1KB5 SAAFPRWLSMGVPRTPSRTVLFERERTGLTYRVPSLLPVPPGPTLLAFVEQRLSPDDSHA ::.:.:: .: .::::.::: : : :: CCDS25 MASLPVLQKESVFQSGAHAYRIPALLYLPGQQSLLAFAEQRASKKDEHA 10 20 30 40 70 80 90 100 110 120 pF1KB5 HRLVLRRGTLAGGS--VRWGALHVLGTAALAEHRSMNPCPVHDAGTGTVFLFFIAVLGHT . .::::: . . :.: : .:.. : : ::::::::..:: :::.::::::. :.. CCDS25 ELIVLRRGDYDAPTHQVQWQAQEVVAQARLDGHRSMNPCPLYDAQTGTLFLFFIAIPGQV 50 60 70 80 90 100 130 140 150 160 170 180 pF1KB5 PEAVQIATGRNAARLCCVASRDAGLSWGSARDLTEEAIGGAVQDWATFAVGPGHGVQLPS : :. : :..::: :.: : : .:.: ::::. ::: : ..:.:::::::: .:: . CCDS25 TEQQQLQTRANVTRLCQVTSTDHGRTWSSPRDLTDAAIGPAYREWSTFAVGPGHCLQLHD 110 120 130 140 150 160 190 200 210 220 230 pF1KB5 -GR-LLVPAYTYRVDRRECFGKICRTSPHSFAFYSDDHGRTWRCGGLVPNLRSGECQLAA .: :.::::.:: . : : : .: : : :::::: : .: . . :::.: CCDS25 RARSLVVPAYAYRK-----LHPIQRPIPSAFCFLSHDHGRTWARGHFVAQ-DTLECQVAE 170 180 190 200 210 220 240 250 260 270 280 290 pF1KB5 VDGGQAGSFLYCNARSPLGSRVQALSTDEGTSFLPAERVASLPETA-WGCQGSIVGFPAP :. :. . :::: : .:::: ::..: .: .. : .: : :::::...::.: CCDS25 VETGEQ-RVVTLNARSHLRARVQAQSTNDGLDFQESQLVKKLVEPPPQGCQGSVISFPSP 230 240 250 260 270 280 300 310 320 330 340 350 pF1KB5 APNRPRDDSWSVGPGSPLQPPLLGPGVHEPPEEAAVDPRGGQVPGGPFSRLQPRGDGPRQ :. ::::: : CCDS25 -----RS-----GPGSPAQ----------------------------------------- 290 360 370 380 390 400 410 pF1KB5 PGPRPGVSGDVGSWTLALPMPFAAPPQSPTWLLYSHPVGRRARLHMGIRLSQSPLDPRSW ::::.::. : .: :. : :..: CCDS25 ------------------------------WLLYTHPTHSWQRADLGAYLNPRPPAPEAW 300 310 320 420 430 440 450 460 470 pF1KB5 TEPWVIYEGPSGYSDLASIGPAPEGGLVFACLYESGARTSYDEISFCTFSLREVLENVPA .:: .. .: .:::: :.: .:.:. .:.::::.. .:.:: : :.:.... :: CCDS25 SEPVLLAKGSCAYSDLQSMGTGPDGSPLFGCLYEAN---DYEEIVFLMFTLKQAF---PA 330 340 350 360 370 480 490 pF1KB5 SPKPPNLGDKPRGCCWPS : CCDS25 EYLPQ 380 496 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Sat Nov 5 21:06:06 2016 done: Sat Nov 5 21:06:06 2016 Total Scan time: 3.440 Total Display time: 0.020 Function used was FASTA [36.3.4 Apr, 2011]