FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KB8129, 341 aa 1>>>pF1KB8129 341 - 341 aa - 341 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 6.3126+/-0.000819; mu= 12.0391+/- 0.050 mean_var=95.6929+/-18.712, 0's: 0 Z-trim(109.6): 15 B-trim: 10 in 1/52 Lambda= 0.131110 statistics sampled from 11019 (11030) to 11019 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.705), E-opt: 0.2 (0.339), width: 16 Scan time: 2.010 The best scores are: opt bits E(32554) CCDS7765.1 ARFIP2 gene_id:23647|Hs108|chr11 ( 341) 2216 429.1 2.5e-120 CCDS55740.1 ARFIP2 gene_id:23647|Hs108|chr11 ( 303) 1963 381.3 5.7e-106 CCDS73250.1 ARFIP2 gene_id:23647|Hs108|chr11 ( 374) 1794 349.3 2.9e-96 CCDS55739.1 ARFIP2 gene_id:23647|Hs108|chr11 ( 256) 1506 294.8 5.2e-80 CCDS3780.1 ARFIP1 gene_id:27236|Hs108|chr4 ( 341) 1358 266.8 1.8e-71 CCDS34080.1 ARFIP1 gene_id:27236|Hs108|chr4 ( 373) 1261 248.5 6.4e-66 >>CCDS7765.1 ARFIP2 gene_id:23647|Hs108|chr11 (341 aa) initn: 2216 init1: 2216 opt: 2216 Z-score: 2273.8 bits: 429.1 E(32554): 2.5e-120 Smith-Waterman score: 2216; 100.0% identity (100.0% similar) in 341 aa overlap (1-341:1-341) 10 20 30 40 50 60 pF1KB8 MTDGILGKAATMEIPIHGNGEARQLPEDDGLEQDLQQVMVSGPNLNETSIVSGGYGGSGD :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS77 MTDGILGKAATMEIPIHGNGEARQLPEDDGLEQDLQQVMVSGPNLNETSIVSGGYGGSGD 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB8 GLIPTGSGRHPSHSTTPSGPGDEVARGIAGEKFDIVKKWGINTYKCTKQLLSERFGRGSR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS77 GLIPTGSGRHPSHSTTPSGPGDEVARGIAGEKFDIVKKWGINTYKCTKQLLSERFGRGSR 70 80 90 100 110 120 130 140 150 160 170 180 pF1KB8 TVDLELELQIELLRETKRKYESVLQLGRALTAHLYSLLQTQHALGDAFADLSQKSPELQE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS77 TVDLELELQIELLRETKRKYESVLQLGRALTAHLYSLLQTQHALGDAFADLSQKSPELQE 130 140 150 160 170 180 190 200 210 220 230 240 pF1KB8 EFGYNAETQKLLCKNGETLLGAVNFFVSSINTLVTKTMEDTLMTVKQYEAARLEYDAYRT :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS77 EFGYNAETQKLLCKNGETLLGAVNFFVSSINTLVTKTMEDTLMTVKQYEAARLEYDAYRT 190 200 210 220 230 240 250 260 270 280 290 300 pF1KB8 DLEELSLGPRDAGTRGRLESAQATFQAHRDKYEKLRGDVAIKLKFLEENKIKVMHKQLLL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS77 DLEELSLGPRDAGTRGRLESAQATFQAHRDKYEKLRGDVAIKLKFLEENKIKVMHKQLLL 250 260 270 280 290 300 310 320 330 340 pF1KB8 FHNAVSAYFAGNQKQLEQTLQQFNIKLRPPGAEKPSWLEEQ ::::::::::::::::::::::::::::::::::::::::: CCDS77 FHNAVSAYFAGNQKQLEQTLQQFNIKLRPPGAEKPSWLEEQ 310 320 330 340 >>CCDS55740.1 ARFIP2 gene_id:23647|Hs108|chr11 (303 aa) initn: 1963 init1: 1963 opt: 1963 Z-score: 2016.0 bits: 381.3 E(32554): 5.7e-106 Smith-Waterman score: 1963; 100.0% identity (100.0% similar) in 303 aa overlap (39-341:1-303) 10 20 30 40 50 60 pF1KB8 AATMEIPIHGNGEARQLPEDDGLEQDLQQVMVSGPNLNETSIVSGGYGGSGDGLIPTGSG :::::::::::::::::::::::::::::: CCDS55 MVSGPNLNETSIVSGGYGGSGDGLIPTGSG 10 20 30 70 80 90 100 110 120 pF1KB8 RHPSHSTTPSGPGDEVARGIAGEKFDIVKKWGINTYKCTKQLLSERFGRGSRTVDLELEL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS55 RHPSHSTTPSGPGDEVARGIAGEKFDIVKKWGINTYKCTKQLLSERFGRGSRTVDLELEL 40 50 60 70 80 90 130 140 150 160 170 180 pF1KB8 QIELLRETKRKYESVLQLGRALTAHLYSLLQTQHALGDAFADLSQKSPELQEEFGYNAET :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS55 QIELLRETKRKYESVLQLGRALTAHLYSLLQTQHALGDAFADLSQKSPELQEEFGYNAET 100 110 120 130 140 150 190 200 210 220 230 240 pF1KB8 QKLLCKNGETLLGAVNFFVSSINTLVTKTMEDTLMTVKQYEAARLEYDAYRTDLEELSLG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS55 QKLLCKNGETLLGAVNFFVSSINTLVTKTMEDTLMTVKQYEAARLEYDAYRTDLEELSLG 160 170 180 190 200 210 250 260 270 280 290 300 pF1KB8 PRDAGTRGRLESAQATFQAHRDKYEKLRGDVAIKLKFLEENKIKVMHKQLLLFHNAVSAY :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS55 PRDAGTRGRLESAQATFQAHRDKYEKLRGDVAIKLKFLEENKIKVMHKQLLLFHNAVSAY 220 230 240 250 260 270 310 320 330 340 pF1KB8 FAGNQKQLEQTLQQFNIKLRPPGAEKPSWLEEQ ::::::::::::::::::::::::::::::::: CCDS55 FAGNQKQLEQTLQQFNIKLRPPGAEKPSWLEEQ 280 290 300 >>CCDS73250.1 ARFIP2 gene_id:23647|Hs108|chr11 (374 aa) initn: 1794 init1: 1794 opt: 1794 Z-score: 1841.8 bits: 349.3 E(32554): 2.9e-96 Smith-Waterman score: 2133; 91.2% identity (91.2% similar) in 373 aa overlap (2-341:2-374) 10 20 30 40 50 60 pF1KB8 MTDGILGKAATMEIPIHGNGEARQLPEDDGLEQDLQQVMVSGPNLNETSIVSGGYGGSGD ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS73 MTDGILGKAATMEIPIHGNGEARQLPEDDGLEQDLQQVMVSGPNLNETSIVSGGYGGSGD 10 20 30 40 50 60 70 80 pF1KB8 GLIPTG---------------------------------SGRHPSHSTTPSGPGDEVARG :::::: ::::::::::::::::::::: CCDS73 GLIPTGKSISCARREVWVGPQRCLVYLREAGVRELDPLGSGRHPSHSTTPSGPGDEVARG 70 80 90 100 110 120 90 100 110 120 130 140 pF1KB8 IAGEKFDIVKKWGINTYKCTKQLLSERFGRGSRTVDLELELQIELLRETKRKYESVLQLG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS73 IAGEKFDIVKKWGINTYKCTKQLLSERFGRGSRTVDLELELQIELLRETKRKYESVLQLG 130 140 150 160 170 180 150 160 170 180 190 200 pF1KB8 RALTAHLYSLLQTQHALGDAFADLSQKSPELQEEFGYNAETQKLLCKNGETLLGAVNFFV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS73 RALTAHLYSLLQTQHALGDAFADLSQKSPELQEEFGYNAETQKLLCKNGETLLGAVNFFV 190 200 210 220 230 240 210 220 230 240 250 260 pF1KB8 SSINTLVTKTMEDTLMTVKQYEAARLEYDAYRTDLEELSLGPRDAGTRGRLESAQATFQA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS73 SSINTLVTKTMEDTLMTVKQYEAARLEYDAYRTDLEELSLGPRDAGTRGRLESAQATFQA 250 260 270 280 290 300 270 280 290 300 310 320 pF1KB8 HRDKYEKLRGDVAIKLKFLEENKIKVMHKQLLLFHNAVSAYFAGNQKQLEQTLQQFNIKL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS73 HRDKYEKLRGDVAIKLKFLEENKIKVMHKQLLLFHNAVSAYFAGNQKQLEQTLQQFNIKL 310 320 330 340 350 360 330 340 pF1KB8 RPPGAEKPSWLEEQ :::::::::::::: CCDS73 RPPGAEKPSWLEEQ 370 >>CCDS55739.1 ARFIP2 gene_id:23647|Hs108|chr11 (256 aa) initn: 1506 init1: 1506 opt: 1506 Z-score: 1549.9 bits: 294.8 E(32554): 5.2e-80 Smith-Waterman score: 1506; 99.6% identity (100.0% similar) in 237 aa overlap (105-341:20-256) 80 90 100 110 120 130 pF1KB8 TTPSGPGDEVARGIAGEKFDIVKKWGINTYKCTKQLLSERFGRGSRTVDLELELQIELLR .::::::::::::::::::::::::::::: CCDS55 MKPALCLVAMGALVMDSSPQCTKQLLSERFGRGSRTVDLELELQIELLR 10 20 30 40 140 150 160 170 180 190 pF1KB8 ETKRKYESVLQLGRALTAHLYSLLQTQHALGDAFADLSQKSPELQEEFGYNAETQKLLCK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS55 ETKRKYESVLQLGRALTAHLYSLLQTQHALGDAFADLSQKSPELQEEFGYNAETQKLLCK 50 60 70 80 90 100 200 210 220 230 240 250 pF1KB8 NGETLLGAVNFFVSSINTLVTKTMEDTLMTVKQYEAARLEYDAYRTDLEELSLGPRDAGT :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS55 NGETLLGAVNFFVSSINTLVTKTMEDTLMTVKQYEAARLEYDAYRTDLEELSLGPRDAGT 110 120 130 140 150 160 260 270 280 290 300 310 pF1KB8 RGRLESAQATFQAHRDKYEKLRGDVAIKLKFLEENKIKVMHKQLLLFHNAVSAYFAGNQK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS55 RGRLESAQATFQAHRDKYEKLRGDVAIKLKFLEENKIKVMHKQLLLFHNAVSAYFAGNQK 170 180 190 200 210 220 320 330 340 pF1KB8 QLEQTLQQFNIKLRPPGAEKPSWLEEQ ::::::::::::::::::::::::::: CCDS55 QLEQTLQQFNIKLRPPGAEKPSWLEEQ 230 240 250 >>CCDS3780.1 ARFIP1 gene_id:27236|Hs108|chr4 (341 aa) initn: 1355 init1: 1227 opt: 1358 Z-score: 1396.7 bits: 266.8 E(32554): 1.8e-71 Smith-Waterman score: 1358; 59.9% identity (89.3% similar) in 337 aa overlap (8-341:7-341) 10 20 30 40 50 60 pF1KB8 MTDGILGKAATMEIPIHGNGEARQLPEDDGLEQDLQQVMVSGPNLNETSIVSGGYGGSGD : .. :::. .:::. . .. ....::.. . :: .:.::.:.: :. .. . CCDS37 MAQESPKNSAAEIPVTSNGEVDD-SREHSFNRDLKHSLPSGLGLSETQITSHGFDNTKE 10 20 30 40 50 70 80 90 100 110 pF1KB8 GLIPTGSGRHPSHSTTPSGP---GDEVARGIAGEKFDIVKKWGINTYKCTKQLLSERFGR :.: .:. . ... : ::: .::. .. : ::...:.::..::::::.:..::..:: CCDS37 GVIEAGAFQGGQRTQTKSGPVILADEI-KNPAMEKLELVRKWSLNTYKCTRQIISEKLGR 60 70 80 90 100 110 120 130 140 150 160 170 pF1KB8 GSRTVDLELELQIELLRETKRKYESVLQLGRALTAHLYSLLQTQHALGDAFADLSQKSPE :::::::::: ::..::..:.:::..:.:...:...:.....::. ::::::::: :: : CCDS37 GSRTVDLELEAQIDILRDNKKKYENILKLAQTLSTQLFQMVHTQRQLGDAFADLSLKSLE 120 130 140 150 160 170 180 190 200 210 220 230 pF1KB8 LQEEFGYNAETQKLLCKNGETLLGAVNFFVSSINTLVTKTMEDTLMTVKQYEAARLEYDA :.:::::::.::::: :::::::::.:::..:.::::.::.:::::::::::.::.:::: CCDS37 LHEEFGYNADTQKLLAKNGETLLGAINFFIASVNTLVNKTIEDTLMTVKQYESARIEYDA 180 190 200 210 220 230 240 250 260 270 280 290 pF1KB8 YRTDLEELSLGPRDAGTRGRLESAQATFQAHRDKYEKLRGDVAIKLKFLEENKIKVMHKQ ::::::::.::::::.: ..:..: ::::..::.:.:.::..:::::::::.::.:.: CCDS37 YRTDLEELNLGPRDANTLPKIEQSQHLFQAHKEKYDKMRNDVSVKLKFLEENKVKVLHNQ 240 250 260 270 280 290 300 310 320 330 340 pF1KB8 LLLFHNAVSAYFAGNQKQLEQTLQQFNIKLRPPGAEKPSWLEEQ :.:::::..::::::::::::::.::.:::. ::.. ::::::: CCDS37 LVLFHNAIAAYFAGNQKQLEQTLKQFHIKLKTPGVDAPSWLEEQ 300 310 320 330 340 >>CCDS34080.1 ARFIP1 gene_id:27236|Hs108|chr4 (373 aa) initn: 1337 init1: 1227 opt: 1261 Z-score: 1297.0 bits: 248.5 E(32554): 6.4e-66 Smith-Waterman score: 1293; 55.6% identity (81.6% similar) in 369 aa overlap (8-341:7-373) 10 20 30 40 50 60 pF1KB8 MTDGILGKAATMEIPIHGNGEARQLPEDDGLEQDLQQVMVSGPNLNETSIVSGGYGGSGD : .. :::. .:::. . .. ....::.. . :: .:.::.:.: :. .. . CCDS34 MAQESPKNSAAEIPVTSNGEVDD-SREHSFNRDLKHSLPSGLGLSETQITSHGFDNTKE 10 20 30 40 50 70 80 pF1KB8 GLIPTG-----------SGRHPSH---------------------STTPSGP---GDEVA :.: .: : ::. . : ::: .::. CCDS34 GVIEAGAFQGSPAPPLPSVMSPSRVAASRLAQQGSDLIVPAGGQRTQTKSGPVILADEI- 60 70 80 90 100 110 90 100 110 120 130 140 pF1KB8 RGIAGEKFDIVKKWGINTYKCTKQLLSERFGRGSRTVDLELELQIELLRETKRKYESVLQ .. : ::...:.::..::::::.:..::..:::::::::::: ::..::..:.:::..:. CCDS34 KNPAMEKLELVRKWSLNTYKCTRQIISEKLGRGSRTVDLELEAQIDILRDNKKKYENILK 120 130 140 150 160 170 150 160 170 180 190 200 pF1KB8 LGRALTAHLYSLLQTQHALGDAFADLSQKSPELQEEFGYNAETQKLLCKNGETLLGAVNF :...:...:.....::. ::::::::: :: ::.:::::::.::::: :::::::::.:: CCDS34 LAQTLSTQLFQMVHTQRQLGDAFADLSLKSLELHEEFGYNADTQKLLAKNGETLLGAINF 180 190 200 210 220 230 210 220 230 240 250 260 pF1KB8 FVSSINTLVTKTMEDTLMTVKQYEAARLEYDAYRTDLEELSLGPRDAGTRGRLESAQATF :..:.::::.::.:::::::::::.::.::::::::::::.::::::.: ..:..: : CCDS34 FIASVNTLVNKTIEDTLMTVKQYESARIEYDAYRTDLEELNLGPRDANTLPKIEQSQHLF 240 250 260 270 280 290 270 280 290 300 310 320 pF1KB8 QAHRDKYEKLRGDVAIKLKFLEENKIKVMHKQLLLFHNAVSAYFAGNQKQLEQTLQQFNI :::..::.:.:.::..:::::::::.::.:.::.:::::..::::::::::::::.::.: CCDS34 QAHKEKYDKMRNDVSVKLKFLEENKVKVLHNQLVLFHNAIAAYFAGNQKQLEQTLKQFHI 300 310 320 330 340 350 330 340 pF1KB8 KLRPPGAEKPSWLEEQ ::. ::.. ::::::: CCDS34 KLKTPGVDAPSWLEEQ 360 370 341 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Fri Nov 4 09:48:38 2016 done: Fri Nov 4 09:48:38 2016 Total Scan time: 2.010 Total Display time: 0.010 Function used was FASTA [36.3.4 Apr, 2011]