ID S4VTD6_9VIRU Unreviewed; 406 AA.
AC S4VTD6;
DT 16-OCT-2013, integrated into UniProtKB/TrEMBL.
DT 16-OCT-2013, sequence version 1.
DT 27-MAR-2024, entry version 23.
DE SubName: Full=Collagen triple helix domain containing protein {ECO:0000313|EMBL:AGO82680.1};
GN ORFNames=pdul_cds_565 {ECO:0000313|EMBL:AGO82680.1};
OS Pandoravirus dulcis.
OC Viruses; Pandoravirus.
OX NCBI_TaxID=1349409 {ECO:0000313|EMBL:AGO82680.1, ECO:0000313|Proteomes:UP000201566};
RN [1] {ECO:0000313|EMBL:AGO82680.1, ECO:0000313|Proteomes:UP000201566}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Melbourne {ECO:0000313|EMBL:AGO82680.1};
RX PubMed=23869018; DOI=10.1126/science.1239181;
RA Philippe N., Legendre M., Doutre G., Coute Y., Poirot O., Lescot M.,
RA Arslan D., Seltzer V., Bertaux L., Bruley C., Garin J., Claverie J.M.,
RA Abergel C.;
RT "Pandoraviruses: amoeba viruses with genomes up to 2.5 Mb reaching that of
RT parasitic eukaryotes.";
RL Science 341:281-286(2013).
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; KC977570; AGO82680.1; -; Genomic_DNA.
DR RefSeq; YP_008319349.1; NC_021858.1.
DR GeneID; 16512207; -.
DR KEGG; vg:16512207; -.
DR Proteomes; UP000201566; Genome.
DR InterPro; IPR008160; Collagen.
DR PANTHER; PTHR24637; COLLAGEN; 1.
DR PANTHER; PTHR24637:SF428; SCAVENGER RECEPTOR CLASS A MEMBER 3; 1.
DR Pfam; PF01391; Collagen; 2.
PE 4: Predicted;
KW Collagen {ECO:0000313|EMBL:AGO82680.1};
KW Reference proteome {ECO:0000313|Proteomes:UP000201566}.
FT REGION 39..147
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 236..263
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 406 AA; 37685 MW; 626DACFA0DA9E384 CRC64;
MAHTQSTAPV GVGATRVMQV YVRLPRPVPP RGAVRILSAP TAARHGRQGP PGPVGPPGKR
GVAGPSGPRG PAGEAGLNGA PGADGAQGPP GTRGAAGGQG PRGPDGPPGD AGPAGEMGPP
GEQGPAGPTG DQGPPGEVGP PGPLGGLLFG AGTESVSISG TVDLAADVHY LDLVVPVGAR
LRTHGYRVFV SGTLNLDGTI DNDGSINGVA ADQGTVGGGG AGGAALTDGQ SLVHAFGGSG
GDGGPATVPG GAPGDGGTTV LPTAAQGGTG LLNNPLALVQ GRTVDGFLLQ GGAGGGGGGT
SAAFPALAPA VGGGGAGVVV VAAREVIGTG LLRARGGSGS VGEFTGTSAA GSSGGGGGGL
VVLVANTVAS TITFNVEGGP GGINPFAPEA ASGQPGRAVL VRVASE
//