ID E3VY58_MIMIV Unreviewed; 945 AA.
AC E3VY58; A0A0G2Y4H4;
DT 11-JAN-2011, integrated into UniProtKB/TrEMBL.
DT 11-JAN-2011, sequence version 1.
DT 27-MAR-2024, entry version 42.
DE SubName: Full=Collagen-like protein 1 {ECO:0000313|EMBL:AKI80725.1};
GN Name=L71 {ECO:0000313|EMBL:ADO18364.1};
OS Acanthamoeba polyphaga mimivirus (APMV).
OC Viruses; Varidnaviria; Bamfordvirae; Nucleocytoviricota; Megaviricetes;
OC Imitervirales; Mimiviridae; Megamimivirinae; Mimivirus;
OC Mimivirus bradfordmassiliense.
OX NCBI_TaxID=212035 {ECO:0000313|EMBL:AKI80725.1, ECO:0000313|Proteomes:UP000274448};
OH NCBI_TaxID=5757; Acanthamoeba polyphaga (Amoeba).
RN [1] {ECO:0000313|EMBL:ADO18364.1, ECO:0000313|Proteomes:UP000201519}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX PubMed=21375749;
RA Legendre M., Santini S., Rico A., Abergel C., Claverie J.M.;
RT "Breaking the 1000-gene barrier for Mimivirus using ultra-deep genome and
RT transcriptome sequencing.";
RL Virol. J. 8:99-99(2011).
RN [2] {ECO:0000313|EMBL:AKI80725.1, ECO:0000313|Proteomes:UP000274448}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Amazonia {ECO:0000313|EMBL:AKI80725.1,
RC ECO:0000313|Proteomes:UP000274448};
RA Assis F.L., Abrahao J.S., Kroon E.G., Dornas F.P., Andrade K.R.,
RA Borato P.V.M., Pilotto M.R., Benamar S., LaScola B., Colson P.;
RT "Pan-genome analysis of Brazilian lineage A amoebal mimiviruses.";
RL Submitted (OCT-2014) to the EMBL/GenBank/DDBJ databases.
CC -!- FUNCTION: May participate in the formation of a layer of cross-linked
CC glycosylated fibrils at the viral surface thus giving it a hairy-like
CC appearance. {ECO:0000256|ARBA:ARBA00003026}.
CC -!- SUBCELLULAR LOCATION: Virion {ECO:0000256|ARBA:ARBA00004328}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; HQ336222; ADO18364.1; -; Genomic_DNA.
DR EMBL; KM982403; AKI80725.1; -; Genomic_DNA.
DR RefSeq; YP_003986560.1; NC_014649.1.
DR GeneID; 9924664; -.
DR KEGG; vg:9924664; -.
DR OrthoDB; 33298at10239; -.
DR Proteomes; UP000201519; Genome.
DR Proteomes; UP000274448; Genome.
DR GO; GO:0044423; C:virion component; IEA:UniProtKB-KW.
DR InterPro; IPR008160; Collagen.
DR InterPro; IPR021210; Exosporium_BclB.
DR NCBIfam; TIGR03721; exospore_TM; 1.
DR PANTHER; PTHR24023; COLLAGEN ALPHA; 1.
DR PANTHER; PTHR24023:SF1082; COLLAGEN ALPHA-1(X) CHAIN; 1.
DR Pfam; PF01391; Collagen; 11.
PE 4: Predicted;
KW Collagen {ECO:0000313|EMBL:AKI80725.1};
KW Virion {ECO:0000256|ARBA:ARBA00022844}.
FT REGION 80..226
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 257..441
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 488..712
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 733..768
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 117..209
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 257..440
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 488..686
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 737..757
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 945 AA; 93884 MW; E6432D6EDB7A7BC8 CRC64;
MSRITCPITD CKCKCNKNNC VYCVMGRQGL PGPKGSSGNS IYVGTGVPSP FLGNNGDLYI
DSSTGLLYAK VNGVWVPQGS LKGDPGASGS KGEKGDKGSS GEAGLKGEQG TKGEQGDQGE
QGDKGDKGDK GDVGAKGDQG DKGDQGDVGA KGDQGDKGDQ GDVGAKGDQG DKGDKGDQGD
KGDVGDPGVK GDKGDTGDKG DKGDKGDKGQ NGSEILFGLG IPSPDLGEDG DVYIDTLTGN
VYQKIGGVWV LETNIKGEKG DQGDKGDTGS KGDQGDKGDQ GDKGDQGDKG DVGDKGNKGD
TGSKGDVGDK GDVGDKGDKG DTGDKGDKGD TGDKGDKGDV GDKGDKGDVG DKGDVGDKGD
VGDKGDKGDT GDKGDKGDIG DKGDKGDIGD KGDKGDIGDK GDKGDVGDKG DKGDKGDIGD
KGDKGDIGDK GDKGDKGDKG ENGSGILFGL GIPSPDLGED GDIYIDTLTG NVYQKIGGVW
VLETSIKGEK GDKGDTGDKG DTGDKGDTGD KGDTGDKGDT GDKGDVGDKG DVGDKGDVGD
KGDVGDKGDK GDIGDKGDKG DLGDKGDKGD VGDKGDVGDK GDKGDIGDKG DKGDLGDKGD
KGDVGDKGDK GDVGDKGDKG DIGDKGDKGD VGDKGDKGDI GDKGDKGDKG DVGSKGDKGD
KGDVGDKGDK GDVGSKGDKG DKGDKGDVGP VGASILFGAG VPSPTTGENG DSYIDNSTGV
FYLKINDVWV PQTNIKGDKG DKGDKGDKGD KGDTGDVGLK GDTGTPGSGP IIPYSSGLTP
VALAVVAVAG GGIADTGASY DFGVSSPSVT LVGVNLDFTG PVQGLLPNMA WSAPRDTVIT
SLATAFQVSV AISAVLEPIF LRTQVYRELA ANPGVFEPLA GAIVEFDVAS SALISVGTVF
RGIVTGLSIP VNAGDRLIVF ANTRTTSLIS VGTVTGFISS GLALA
//