GenomeNet

Database: UniProt
Entry: H2ZIA6_CIOSA
LinkDB: H2ZIA6_CIOSA
Original site: H2ZIA6_CIOSA 
ID   H2ZIA6_CIOSA            Unreviewed;      3071 AA.
AC   H2ZIA6;
DT   18-APR-2012, integrated into UniProtKB/TrEMBL.
DT   18-APR-2012, sequence version 1.
DT   05-JUN-2019, entry version 52.
DE   SubName: Full=Uncharacterized protein {ECO:0000313|Ensembl:ENSCSAVP00000017322};
OS   Ciona savignyi (Pacific transparent sea squirt).
OC   Eukaryota; Metazoa; Chordata; Tunicata; Ascidiacea; Enterogona;
OC   Phlebobranchia; Cionidae; Ciona.
OX   NCBI_TaxID=51511 {ECO:0000313|Ensembl:ENSCSAVP00000017322, ECO:0000313|Proteomes:UP000007875};
RN   [1] {ECO:0000313|Ensembl:ENSCSAVP00000017322}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RA   Birren B., Nusbaum C., Abebe A., Abouelleil A., Adekoya E.,
RA   Ait-zahra M., Allen N., Allen T., An P., Anderson M., Anderson S.,
RA   Arachchi H., Armbruster J., Bachantsang P., Baldwin J., Barry A.,
RA   Bayul T., Blitshsteyn B., Bloom T., Blye J., Boguslavskiy L.,
RA   Borowsky M., Boukhgalter B., Brunache A., Butler J., Calixte N.,
RA   Calvo S., Camarata J., Campo K., Chang J., Cheshatsang Y., Citroen M.,
RA   Collymore A., Considine T., Cook A., Cooke P., Corum B., Cuomo C.,
RA   David R., Dawoe T., Degray S., Dodge S., Dooley K., Dorje P.,
RA   Dorjee K., Dorris L., Duffey N., Dupes A., Elkins T., Engels R.,
RA   Erickson J., Farina A., Faro S., Ferreira P., Fischer H.,
RA   Fitzgerald M., Foley K., Gage D., Galagan J., Gearin G., Gnerre S.,
RA   Gnirke A., Goyette A., Graham J., Grandbois E., Gyaltsen K., Hafez N.,
RA   Hagopian D., Hagos B., Hall J., Hatcher B., Heller A., Higgins H.,
RA   Honan T., Horn A., Houde N., Hughes L., Hulme W., Husby E., Iliev I.,
RA   Jaffe D., Jones C., Kamal M., Kamat A., Kamvysselis M., Karlsson E.,
RA   Kells C., Kieu A., Kisner P., Kodira C., Kulbokas E., Labutti K.,
RA   Lama D., Landers T., Leger J., Levine S., Lewis D., Lewis T.,
RA   Lindblad-toh K., Liu X., Lokyitsang T., Lokyitsang Y., Lucien O.,
RA   Lui A., Ma L.J., Mabbitt R., Macdonald J., Maclean C., Major J.,
RA   Manning J., Marabella R., Maru K., Matthews C., Mauceli E.,
RA   Mccarthy M., Mcdonough S., Mcghee T., Meldrim J., Meneus L.,
RA   Mesirov J., Mihalev A., Mihova T., Mikkelsen T., Mlenga V., Moru K.,
RA   Mozes J., Mulrain L., Munson G., Naylor J., Newes C., Nguyen C.,
RA   Nguyen N., Nguyen T., Nicol R., Nielsen C., Nizzari M., Norbu C.,
RA   Norbu N., O'donnell P., Okoawo O., O'leary S., Omotosho B.,
RA   O'neill K., Osman S., Parker S., Perrin D., Phunkhang P., Piqani B.,
RA   Purcell S., Rachupka T., Ramasamy U., Rameau R., Ray V., Raymond C.,
RA   Retta R., Richardson S., Rise C., Rodriguez J., Rogers J., Rogov P.,
RA   Rutman M., Schupbach R., Seaman C., Settipalli S., Sharpe T.,
RA   Sheridan J., Sherpa N., Shi J., Smirnov S., Smith C., Sougnez C.,
RA   Spencer B., Stalker J., Stange-thomann N., Stavropoulos S.,
RA   Stetson K., Stone C., Stone S., Stubbs M., Talamas J., Tchuinga P.,
RA   Tenzing P., Tesfaye S., Theodore J., Thoulutsang Y., Topham K.,
RA   Towey S., Tsamla T., Tsomo N., Vallee D., Vassiliev H.,
RA   Venkataraman V., Vinson J., Vo A., Wade C., Wang S., Wangchuk T.,
RA   Wangdi T., Whittaker C., Wilkinson J., Wu Y., Wyman D., Yadav S.,
RA   Yang S., Yang X., Yeager S., Yee E., Young G., Zainoun J., Zembeck L.,
RA   Zimmer A., Zody M., Lander E.;
RL   Submitted (AUG-2003) to the EMBL/GenBank/DDBJ databases.
RN   [2] {ECO:0000313|Ensembl:ENSCSAVP00000017322}
RP   IDENTIFICATION.
RG   Ensembl;
RL   Submitted (FEB-2012) to UniProtKB.
CC   -!- CAUTION: Lacks conserved residue(s) required for the propagation
CC       of feature annotation. {ECO:0000256|PROSITE-ProRule:PRU00122}.
CC   -----------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   -----------------------------------------------------------------------
DR   STRING; 51511.ENSCSAVP00000017322; -.
DR   Ensembl; ENSCSAVT00000017511; ENSCSAVP00000017322; ENSCSAVG00000010201.
DR   eggNOG; ENOG410IP6E; Eukaryota.
DR   eggNOG; ENOG410XP6Z; LUCA.
DR   GeneTree; ENSGT00940000161177; -.
DR   InParanoid; H2ZIA6; -.
DR   OMA; FGPCNCK; -.
DR   TreeFam; TF335359; -.
DR   Proteomes; UP000007875; Unassembled WGS sequence.
DR   GO; GO:0007155; P:cell adhesion; IEA:InterPro.
DR   Gene3D; 2.60.120.1490; -; 1.
DR   InterPro; IPR013320; ConA-like_dom_sf.
DR   InterPro; IPR013032; EGF-like_CS.
DR   InterPro; IPR000742; EGF-like_dom.
DR   InterPro; IPR008979; Galactose-bd-like_sf.
DR   InterPro; IPR010307; Laminin_dom_II.
DR   InterPro; IPR002049; Laminin_EGF.
DR   InterPro; IPR001791; Laminin_G.
DR   InterPro; IPR000034; Laminin_IV.
DR   InterPro; IPR008211; Laminin_N.
DR   InterPro; IPR038684; Laminin_N_sf.
DR   Pfam; PF00052; Laminin_B; 2.
DR   Pfam; PF00053; Laminin_EGF; 16.
DR   Pfam; PF00054; Laminin_G_1; 5.
DR   Pfam; PF06009; Laminin_II; 1.
DR   Pfam; PF00055; Laminin_N; 1.
DR   SMART; SM00181; EGF; 8.
DR   SMART; SM00180; EGF_Lam; 16.
DR   SMART; SM00281; LamB; 2.
DR   SMART; SM00282; LamG; 5.
DR   SMART; SM00136; LamNT; 1.
DR   SUPFAM; SSF49785; SSF49785; 1.
DR   SUPFAM; SSF49899; SSF49899; 5.
DR   PROSITE; PS00022; EGF_1; 2.
DR   PROSITE; PS01248; EGF_LAM_1; 5.
DR   PROSITE; PS50027; EGF_LAM_2; 15.
DR   PROSITE; PS50025; LAM_G_DOMAIN; 5.
DR   PROSITE; PS51115; LAMININ_IVA; 2.
DR   PROSITE; PS51117; LAMININ_NTER; 1.
PE   4: Predicted;
KW   Coiled coil {ECO:0000256|SAM:Coils};
KW   Complete proteome {ECO:0000313|Proteomes:UP000007875};
KW   Disulfide bond {ECO:0000256|PROSITE-ProRule:PRU00460,
KW   ECO:0000256|SAAS:SAAS00814887};
KW   Laminin EGF-like domain {ECO:0000256|PROSITE-ProRule:PRU00460,
KW   ECO:0000256|SAAS:SAAS00580772};
KW   Reference proteome {ECO:0000313|Proteomes:UP000007875};
KW   Repeat {ECO:0000256|SAAS:SAAS00814929}.
FT   DOMAIN        1    250       Laminin N-terminal. {ECO:0000259|PROSITE:
FT                                PS51117}.
FT   DOMAIN      251    305       Laminin EGF-like. {ECO:0000259|PROSITE:
FT                                PS50027}.
FT   DOMAIN      306    375       Laminin EGF-like. {ECO:0000259|PROSITE:
FT                                PS50027}.
FT   DOMAIN      376    438       Laminin EGF-like. {ECO:0000259|PROSITE:
FT                                PS50027}.
FT   DOMAIN      439    487       Laminin EGF-like. {ECO:0000259|PROSITE:
FT                                PS50027}.
FT   DOMAIN      508    696       Laminin IV type A. {ECO:0000259|PROSITE:
FT                                PS51115}.
FT   DOMAIN      725    774       Laminin EGF-like. {ECO:0000259|PROSITE:
FT                                PS50027}.
FT   DOMAIN      775    832       Laminin EGF-like. {ECO:0000259|PROSITE:
FT                                PS50027}.
FT   DOMAIN      833    880       Laminin EGF-like. {ECO:0000259|PROSITE:
FT                                PS50027}.
FT   DOMAIN      881    929       Laminin EGF-like. {ECO:0000259|PROSITE:
FT                                PS50027}.
FT   DOMAIN      930    976       Laminin EGF-like. {ECO:0000259|PROSITE:
FT                                PS50027}.
FT   DOMAIN      977   1022       Laminin EGF-like. {ECO:0000259|PROSITE:
FT                                PS50027}.
FT   DOMAIN     1023   1068       Laminin EGF-like. {ECO:0000259|PROSITE:
FT                                PS50027}.
FT   DOMAIN     1069   1126       Laminin EGF-like. {ECO:0000259|PROSITE:
FT                                PS50027}.
FT   DOMAIN     1158   1343       Laminin IV type A. {ECO:0000259|PROSITE:
FT                                PS51115}.
FT   DOMAIN     1377   1421       Laminin EGF-like. {ECO:0000259|PROSITE:
FT                                PS50027}.
FT   DOMAIN     1422   1482       Laminin EGF-like. {ECO:0000259|PROSITE:
FT                                PS50027}.
FT   DOMAIN     1483   1529       Laminin EGF-like. {ECO:0000259|PROSITE:
FT                                PS50027}.
FT   DOMAIN     2076   2258       LAM_G_DOMAIN. {ECO:0000259|PROSITE:
FT                                PS50025}.
FT   DOMAIN     2272   2452       LAM_G_DOMAIN. {ECO:0000259|PROSITE:
FT                                PS50025}.
FT   DOMAIN     2457   2641       LAM_G_DOMAIN. {ECO:0000259|PROSITE:
FT                                PS50025}.
FT   DOMAIN     2703   2877       LAM_G_DOMAIN. {ECO:0000259|PROSITE:
FT                                PS50025}.
FT   DOMAIN     2882   3068       LAM_G_DOMAIN. {ECO:0000259|PROSITE:
FT                                PS50025}.
FT   REGION      517    539       Disordered. {ECO:0000256|MobiDB-lite:
FT                                H2ZIA6}.
FT   REGION     2653   2683       Disordered. {ECO:0000256|MobiDB-lite:
FT                                H2ZIA6}.
FT   COILED     1723   1743       {ECO:0000256|SAM:Coils}.
FT   COILED     2051   2071       {ECO:0000256|SAM:Coils}.
FT   COMPBIAS   2653   2669       Polar. {ECO:0000256|MobiDB-lite:H2ZIA6}.
FT   DISULFID    273    282       {ECO:0000256|PROSITE-ProRule:PRU00460}.
FT   DISULFID    343    352       {ECO:0000256|PROSITE-ProRule:PRU00460}.
FT   DISULFID    414    423       {ECO:0000256|PROSITE-ProRule:PRU00460}.
FT   DISULFID    458    467       {ECO:0000256|PROSITE-ProRule:PRU00460}.
FT   DISULFID    744    753       {ECO:0000256|PROSITE-ProRule:PRU00460}.
FT   DISULFID    803    812       {ECO:0000256|PROSITE-ProRule:PRU00460}.
FT   DISULFID    852    861       {ECO:0000256|PROSITE-ProRule:PRU00460}.
FT   DISULFID    864    878       {ECO:0000256|PROSITE-ProRule:PRU00460}.
FT   DISULFID    881    893       {ECO:0000256|PROSITE-ProRule:PRU00460}.
FT   DISULFID    883    900       {ECO:0000256|PROSITE-ProRule:PRU00460}.
FT   DISULFID    902    911       {ECO:0000256|PROSITE-ProRule:PRU00460}.
FT   DISULFID    930    942       {ECO:0000256|PROSITE-ProRule:PRU00460}.
FT   DISULFID    950    959       {ECO:0000256|PROSITE-ProRule:PRU00460}.
FT   DISULFID    995   1004       {ECO:0000256|PROSITE-ProRule:PRU00460}.
FT   DISULFID   1023   1035       {ECO:0000256|PROSITE-ProRule:PRU00460}.
FT   DISULFID   1025   1042       {ECO:0000256|PROSITE-ProRule:PRU00460}.
FT   DISULFID   1044   1053       {ECO:0000256|PROSITE-ProRule:PRU00460}.
FT   DISULFID   1069   1081       {ECO:0000256|PROSITE-ProRule:PRU00460}.
FT   DISULFID   1097   1106       {ECO:0000256|PROSITE-ProRule:PRU00460}.
FT   DISULFID   1396   1405       {ECO:0000256|PROSITE-ProRule:PRU00460}.
FT   DISULFID   1453   1462       {ECO:0000256|PROSITE-ProRule:PRU00460}.
FT   DISULFID   1483   1495       {ECO:0000256|PROSITE-ProRule:PRU00460}.
FT   DISULFID   1485   1502       {ECO:0000256|PROSITE-ProRule:PRU00460}.
FT   DISULFID   1504   1513       {ECO:0000256|PROSITE-ProRule:PRU00460}.
SQ   SEQUENCE   3071 AA;  339704 MW;  72355DDC4AC28931 CRC64;
     GLFPSVVNLA SHAEITSNAT CGERGPDKFC KLVQHVDRRT DRGQQCGICD QFSRNPSERH
     AVENAIDGRG DRWWQSPTID QGYQYHRVTL TLDLKQVFQV AYVIIKAANS PRPGNWILEK
     STDGVTWSPW QYFATSDREC LLKYGKVASQ LGAPNYKTDS EVMCTTYFSR LNPFQNGEIH
     VSLINGRPSA ENPSFELVDF TSARYIRLRL QKIRTLHGDL MTLSIGSTQN DPSVTRRYYY
     SIKDISIGGM CICYGHASTC PTNTQNGLFH CACEHNTVGF NCERCAPGFH QYKWMPGNNG
     FQCEECNCHG HATECYYDPA VNERGDSLNT RGEYIGGGVC VGCRDGTAGI NCQSCVDGMY
     RPLDVEADSS HPCLPCNCNA YGGTRISDSG PYNCVKDIDH IDAENDLFPG SCYCKVGYTG
     QQCDKCAFGY TGFPYCEPCP CNTSGSLNID PCIGDCACKE HVTGDNCNQC KVGFYNLQRN
     NPMGCDECFC NGATQSCVSS NLRWSTIHTN EGWEVRDQRG RRRTRPQIDP DLHEPYVEHG
     PTARELGTTM YYWSAPARYL GNKLNSYGGT LRYTVSYDIL QGAQPMPIQA NDIILDGNGR
     TVIYNHRGML ISPEIETEVA LKLASSVFGS RDGWVNAESL QPVSKADMIA VLSNLRRILI
     RAVYSRNIGA VYRIHDVYMD SATPEGKGLP ASSVEICHCP PGYTGYSCQD CATGYWRNGE
     QCFPCNCNHH SNICNKLNGA CENCQHSTIG IHCERCKPGY YGNALNGTSE DCQRCSCPLA
     TTSNNFSPTC RLDKTGDMMC LECAVGNSGK RCETCSDGYF GNPFAPGGSC QPCSCNGNSG
     ACNQLNGVCK FCKGHTAGDH CERCEAAYYG DAILRKNCGP CECNPIGSLS RQCDFVTGQC
     PCRQDVYGRK CDACASNNYL SIEEKMCVPC DCNPIGSIMM QCRSDGSCVC RTGVVGKTCG
     ICSTGYHSFG DGGCIECDCT HTNGHCNPAT GECVCPANTH GEGCMNCVDG SYQWSLNNGC
     TLCNCSKAGS TSFTCDNITG QCPCKFEYTD KTCNRCRPGY YGYPNCKACN CLVDGTQAST
     CLNDGTCECS DDGQCLCKAN VQGLHCDRCV AGAFGLSNEN EVGCTTCFCG VTNQCQQATY
     YWGNPVRFLL QFAKARQSFE TVFEVTNEIA SRSTVGVADD IALLDSNTAV RAMRSGPYYW
     RLPPQFNGDR RLSYGGKLRY TIYFVSEFST PPGMRPEASV IIQGGNLQTI KYSMEDPHIK
     QTRAYVVPMQ EFGWAHLDSN KNVTRDEFSL ILQNVNSILI KASYGFLMDQ SRLSEVYLDV
     AVAGVTGRGD SATDLPPAYG IEECICPPGY SGLSCQNCMA GYMRVQREPN LGMCELCNCF
     NRSDSCDIFT GKCLNCIDKF VGHNCETCET GYYFSGNSCH KCECPGAGAI NNFSPTCHSD
     GSDGIASYTC DACQIGYEGK HCERCADGYF GDPTVPGGFC AKCGCNVAGS LDGACDKVTG
     QCNCDEGIKG RICDRCSDKH VITTEGCREC NDECAGKLLL EIEEMVNRTN DVDIRDMFPP
     PWNRLIGARN VTAQLQNVFA PSFLYTLTDS STVKLTTTLL SLTQPYIDLL LHRKLNYFVL
     QRRRAKVHGK TTFLLRPKSI FITAVSNMAT RLLHGLQNGT AQDAADIYSK VDIKMNEMRA
     RNINQYKEMA DTELEFAKDM LELVYTNFTA RANQTEPQAH LFNEELIKQV GEIETKFRHL
     QMELSNASTD IRVATTKNVL NSQTLRQTMG NIERINNMAT MTNVTLDEAQ KIVNAAIDLY
     NSTQAEVEAV TLHNNNILGS DLQGLQMSSV KSAVEHSAQL QEDARMMLEV YKNVSAFSYN
     QTEAASRWTN IIRLLVRANK TSKTAIKQGK AALSTATTDL GIRANESLLR NMAILNEVAE
     LRNTVNGDRD LLRSLEDHLR SSNHSALNLT DQHNRLLSRI DKINAAPSLT SNELFTITRS
     SNKVLGEINK QLEGAQENVR LLGETELQVL QRSQIAKEVA DNLDAVSEGM RSVSEAKERT
     TSKLERVQLA SESVRANISN IRQLIEDARR KAGSIRVSVS SKGKCAMAYK PTITPGKTTE
     IVLTIQNSEP NNALLYLARP RQDYIVVELR DYKVTVTWDL GGGPGFVQNP LIIKADTENL
     RSSPDWRNQE LNVTAYRESD KIATMTNVGG VSPGSYSMLD INNDDILFVG GILNTTEVHE
     AVMYRTFNGC MGEAFLQGRT VGLWNFREIN NPEDCKGCIE IEVATDKEQT NLYSYYGRSY
     SVLRFNNHDS KRTQISIRLQ SFSSEGLIVY TGSATHPDFY SIELNKGRIM LKLDLGNGVH
     NASTENQYNT GEWVQVTFRR FAGNSLLSIK TVNQSQENIW LRIQGSDESL NFTSGDVTCV
     GGFPSDRTIK ADVTKISFVG CLKEFSQNNL LRSLKSTVVE DVARDSGVQT ACPAEITRDI
     GISNKGFVEL KPINLQPTSS ISITFNTRNE DAVLLFATSD NRQLRRRRRQ TDEAYYSMYV
     YDGFVHTALK LPGSPHIRLT SKFNIYNDGR DHTATLHRRR ENITLQMNED GDYVSEYLPR
     GADQTITVRR VFVGGIPAEF NSTKAVGTEK SLNGCVRDLM SNLLVNFKEA VQAKNSYFGT
     CAFVEFEPQP TLTSATPPNT FTSNSMNPPK PPRATTPNTQ PPLSNDPAMV PKQPNMVMIG
     NFSAHFGLSR NSHVTFKLKK NIAKNFMSFR FTFDLKVRTL GRNGILVYAG HKTQVDFFAL
     KVKEGIPIFQ FDNGRGRGEA DGSIRINDGK WHRIRLRRSR RKGFISVDGN KKTTSPKGAN
     IMGVDNILYV GGLPESFNSK KIGQLSSMDA CIKDLVLNRK RKLNLSKPRS AHKVDQCYKK
     MESGAFFNGK GYAIYSEKYR VGPRMLIEME FRTWNPSGIL LSINNRRSDG LGIELVNGSI
     YFRTDNGAGP FEAVYSDAST QHNTGRKVKF NLCDGRWHSL IANKDMNSLS LVVDGNAVPT
     VVSRATTNSA DTKDPLYVGG IPDGTTNQQQ VRTNQNLQGC IRNVRLKTTQ VVRFDEIHES
     VGVYPNSCPK R
//
DBGET integrated database retrieval system