GenomeNet

Database: UniProt/TrEMBL
Entry: E5SR55_TRISP
LinkDB: E5SR55_TRISP
Original site: E5SR55_TRISP 
ID   E5SR55_TRISP            Unreviewed;      1465 AA.
AC   E5SR55;
DT   08-MAR-2011, integrated into UniProtKB/TrEMBL.
DT   08-MAR-2011, sequence version 1.
DT   01-MAY-2013, entry version 12.
DE   SubName: Full=Cuticle collagen rol-6;
GN   ORFNames=Tsp_09548;
OS   Trichinella spiralis (Trichina worm).
OC   Eukaryota; Metazoa; Ecdysozoa; Nematoda; Enoplea; Trichocephalida;
OC   Trichinellidae; Trichinella.
OX   NCBI_TaxID=6334;
RN   [1]
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=ISS 195;
RX   PubMed=21336279; DOI=10.1038/ng.769;
RA   Mitreva M., Jasmer D.P., Zarlenga D.S., Wang Z., Abubucker S.,
RA   Martin J., Taylor C.M., Yin Y., Fulton L., Minx P., Yang S.P.,
RA   Warren W.C., Fulton R.S., Bhonagiri V., Zhang X., Hallsworth-Pepin K.,
RA   Clifton S.W., McCarter J.P., Appleton J., Mardis E.R., Wilson R.K.;
RT   "The draft genome of the parasitic nematode Trichinella spiralis.";
RL   Nat. Genet. 43:228-235(2011).
CC   -!- SIMILARITY: Contains 1 ARID domain.
CC   -!- CAUTION: The sequence shown here is derived from an
CC       EMBL/GenBank/DDBJ whole genome shotgun (WGS) entry which is
CC       preliminary data.
CC   -----------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution-NoDerivs License
CC   -----------------------------------------------------------------------
DR   EMBL; ABIR02001550; EFV52694.1; -; Genomic_DNA.
DR   RefSeq; XP_003371361.1; XM_003371313.1.
DR   EnsemblMetazoa; EFV52694; EFV52694; EFV52694.
DR   GeneID; 10903024; -.
DR   KEGG; tsp:Tsp_09548; -.
DR   CTD; 10903024; -.
DR   KO; K11653; -.
DR   GO; GO:0005581; C:collagen; IEA:UniProtKB-KW.
DR   GO; GO:0005634; C:nucleus; IEA:UniProtKB-KW.
DR   GO; GO:0003677; F:DNA binding; IEA:InterPro.
DR   Gene3D; 1.10.150.60; -; 1.
DR   InterPro; IPR001606; ARID/BRIGHT_DNA-bd.
DR   Pfam; PF01388; ARID; 1.
DR   SMART; SM00501; BRIGHT; 1.
DR   SUPFAM; SSF46774; ARID; 1.
DR   PROSITE; PS51011; ARID; 1.
PE   4: Predicted;
KW   Collagen; Complete proteome; Nucleus.
SQ   SEQUENCE   1465 AA;  157761 MW;  71190DE87B9E6896 CRC64;
     MDFRLEYICG SENPNLYCMQ YQHNMYGQTN QSTGTYVCAP HPNGSQVMGS AFSATPMNRM
     PHHGAGGAPP PGGSVVFSPY VNGPSTIGET VVPPPPPHGQ YMGARYHLES PQQRGMYTVN
     GPAVGGEMIK QSPSPALTET FDGCTKDNSR MGLELDVGNR LTFLNFNTSG NGRGTPSSIE
     RQRMSNSSGS SRARSTCGAK ESHPPTPAPT SPGGTSSVHD DIESSLSSPS WPRTPLSPAV
     HRAQSSSHIS MGPPGSSNQQ QQVGCSTLLG PNSGNCSASG SGVGAIKEGQ ASMVARLLGD
     PSLINSTGEE WAERKSFFER LIQFSDSYGH PITSHPTVSK QTVDLHKLYM AVKARGGFEE
     VTKKKYWRDL CVIFNIGVSN SASGQLKKQY SRFLFPFECV YDLGGVDPQP ILASLESKKK
     NKNKSANVSG TDCSSSVGSS NSNYGQASSD GSFTKPYEQV TGSAGQFASQ PGAFPVHQAG
     PQQQQPPAPP QQQHGVKMSS ASMAPPPAQQ QQLQSTHLQH LQANTGGGIN SEMIARDPFD
     DRVQQQQQQH LQQTSSQQQH QQQQQQQQQQ QQQQQQQQQQ QQHQQQQQQQ SVLQQQTHHH
     HPHQQQSQPQ QGQPSQPTAV AGAYFQCGGA PPPFVGRQFS GYGPERPLYT PRLPLGSSAN
     CTVGPGYAGM KPYSAAGQPV SADSSGFSVS QVASLRPSSG NATTGSFVAE DSASHVATYQ
     SSPALKNTHI SVYPVYNNGD NSGSNNSINN NNDDNDNSSS SNNNNNNGVN DCPPSASGMA
     PTPVGPQQPS SMQMPAAYPT QEQQQPQPMM RSAAVGGGNV VLSNQPWMQS HVAASRMPAP
     SLQQPVSGTI PGLSLRETAP SASQQRSSKS SSQMQPHASG TNSGPIVTKD ATSPMGMKST
     VTSQRREKQQ HQQQGVVFPP ECIEATQVMS RRRKKYTAKD IGPIDPWRLV MVFKSGLLFE
     VTWGLNVLNV LLYDDSSAPY FNLNSLPGFL DALVEVWKQS LLELASEMGI QLPIGSVEKT
     MMMNNNTTKS SSRWKRRWYE DDDDEPRVDA ELLVGGDGDD GSGSGRGRGR GRGSGHSDDS
     SETMAKTADR MIEMGRPLPR SEDAGLEDID LKFCNCTCAT SRSVKKMMKM KKKILIETEK
     VDDGDEKQRR QQQEATTVVK QTRGGGTSSN GRTRRKRPLL AAVVTNGGGR GSGVATISGQ
     SNGLGGKRMA ISGQAAGGEQ QETTDVQGGV EQQPQEQQQQ HQTKSTASNN GSRCFCKTKF
     TLNTASLFSL PSWCPFEISS DRQRNALDRC LSASNALTGL SYIPGNEHRM AKCGTLVAGL
     ASLINLDLFE EIFKPLSFRR LTPDENIAED SEAMYEGGKC LQFNSFLDGL KRLVDDAFST
     VGAGHPRMRR SRSTVGRSAQ LGYGSIVDGA GPAVRNGVRA AKSVRSGGAL QNVRRRQERR
     LDASYTSVVA DRAFVETVSQ SFEHH
//
DBGET integrated database retrieval system