ID E5SR55_TRISP Unreviewed; 1465 AA.
AC E5SR55;
DT 08-MAR-2011, integrated into UniProtKB/TrEMBL.
DT 08-MAR-2011, sequence version 1.
DT 01-MAY-2013, entry version 12.
DE SubName: Full=Cuticle collagen rol-6;
GN ORFNames=Tsp_09548;
OS Trichinella spiralis (Trichina worm).
OC Eukaryota; Metazoa; Ecdysozoa; Nematoda; Enoplea; Trichocephalida;
OC Trichinellidae; Trichinella.
OX NCBI_TaxID=6334;
RN [1]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=ISS 195;
RX PubMed=21336279; DOI=10.1038/ng.769;
RA Mitreva M., Jasmer D.P., Zarlenga D.S., Wang Z., Abubucker S.,
RA Martin J., Taylor C.M., Yin Y., Fulton L., Minx P., Yang S.P.,
RA Warren W.C., Fulton R.S., Bhonagiri V., Zhang X., Hallsworth-Pepin K.,
RA Clifton S.W., McCarter J.P., Appleton J., Mardis E.R., Wilson R.K.;
RT "The draft genome of the parasitic nematode Trichinella spiralis.";
RL Nat. Genet. 43:228-235(2011).
CC -!- SIMILARITY: Contains 1 ARID domain.
CC -!- CAUTION: The sequence shown here is derived from an
CC EMBL/GenBank/DDBJ whole genome shotgun (WGS) entry which is
CC preliminary data.
CC -----------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution-NoDerivs License
CC -----------------------------------------------------------------------
DR EMBL; ABIR02001550; EFV52694.1; -; Genomic_DNA.
DR RefSeq; XP_003371361.1; XM_003371313.1.
DR EnsemblMetazoa; EFV52694; EFV52694; EFV52694.
DR GeneID; 10903024; -.
DR KEGG; tsp:Tsp_09548; -.
DR CTD; 10903024; -.
DR KO; K11653; -.
DR GO; GO:0005581; C:collagen; IEA:UniProtKB-KW.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-KW.
DR GO; GO:0003677; F:DNA binding; IEA:InterPro.
DR Gene3D; 1.10.150.60; -; 1.
DR InterPro; IPR001606; ARID/BRIGHT_DNA-bd.
DR Pfam; PF01388; ARID; 1.
DR SMART; SM00501; BRIGHT; 1.
DR SUPFAM; SSF46774; ARID; 1.
DR PROSITE; PS51011; ARID; 1.
PE 4: Predicted;
KW Collagen; Complete proteome; Nucleus.
SQ SEQUENCE 1465 AA; 157761 MW; 71190DE87B9E6896 CRC64;
MDFRLEYICG SENPNLYCMQ YQHNMYGQTN QSTGTYVCAP HPNGSQVMGS AFSATPMNRM
PHHGAGGAPP PGGSVVFSPY VNGPSTIGET VVPPPPPHGQ YMGARYHLES PQQRGMYTVN
GPAVGGEMIK QSPSPALTET FDGCTKDNSR MGLELDVGNR LTFLNFNTSG NGRGTPSSIE
RQRMSNSSGS SRARSTCGAK ESHPPTPAPT SPGGTSSVHD DIESSLSSPS WPRTPLSPAV
HRAQSSSHIS MGPPGSSNQQ QQVGCSTLLG PNSGNCSASG SGVGAIKEGQ ASMVARLLGD
PSLINSTGEE WAERKSFFER LIQFSDSYGH PITSHPTVSK QTVDLHKLYM AVKARGGFEE
VTKKKYWRDL CVIFNIGVSN SASGQLKKQY SRFLFPFECV YDLGGVDPQP ILASLESKKK
NKNKSANVSG TDCSSSVGSS NSNYGQASSD GSFTKPYEQV TGSAGQFASQ PGAFPVHQAG
PQQQQPPAPP QQQHGVKMSS ASMAPPPAQQ QQLQSTHLQH LQANTGGGIN SEMIARDPFD
DRVQQQQQQH LQQTSSQQQH QQQQQQQQQQ QQQQQQQQQQ QQHQQQQQQQ SVLQQQTHHH
HPHQQQSQPQ QGQPSQPTAV AGAYFQCGGA PPPFVGRQFS GYGPERPLYT PRLPLGSSAN
CTVGPGYAGM KPYSAAGQPV SADSSGFSVS QVASLRPSSG NATTGSFVAE DSASHVATYQ
SSPALKNTHI SVYPVYNNGD NSGSNNSINN NNDDNDNSSS SNNNNNNGVN DCPPSASGMA
PTPVGPQQPS SMQMPAAYPT QEQQQPQPMM RSAAVGGGNV VLSNQPWMQS HVAASRMPAP
SLQQPVSGTI PGLSLRETAP SASQQRSSKS SSQMQPHASG TNSGPIVTKD ATSPMGMKST
VTSQRREKQQ HQQQGVVFPP ECIEATQVMS RRRKKYTAKD IGPIDPWRLV MVFKSGLLFE
VTWGLNVLNV LLYDDSSAPY FNLNSLPGFL DALVEVWKQS LLELASEMGI QLPIGSVEKT
MMMNNNTTKS SSRWKRRWYE DDDDEPRVDA ELLVGGDGDD GSGSGRGRGR GRGSGHSDDS
SETMAKTADR MIEMGRPLPR SEDAGLEDID LKFCNCTCAT SRSVKKMMKM KKKILIETEK
VDDGDEKQRR QQQEATTVVK QTRGGGTSSN GRTRRKRPLL AAVVTNGGGR GSGVATISGQ
SNGLGGKRMA ISGQAAGGEQ QETTDVQGGV EQQPQEQQQQ HQTKSTASNN GSRCFCKTKF
TLNTASLFSL PSWCPFEISS DRQRNALDRC LSASNALTGL SYIPGNEHRM AKCGTLVAGL
ASLINLDLFE EIFKPLSFRR LTPDENIAED SEAMYEGGKC LQFNSFLDGL KRLVDDAFST
VGAGHPRMRR SRSTVGRSAQ LGYGSIVDGA GPAVRNGVRA AKSVRSGGAL QNVRRRQERR
LDASYTSVVA DRAFVETVSQ SFEHH
//