ID A0A0V0Z8W9_9BILA Unreviewed; 1800 AA.
AC A0A0V0Z8W9;
DT 16-MAR-2016, integrated into UniProtKB/TrEMBL.
DT 16-MAR-2016, sequence version 1.
DT 27-MAR-2024, entry version 31.
DE SubName: Full=Collagen alpha-1(IV) chain {ECO:0000313|EMBL:KRY09026.1};
GN Name=emb-9 {ECO:0000313|EMBL:KRY09026.1};
GN ORFNames=T12_10677 {ECO:0000313|EMBL:KRY09026.1};
OS Trichinella patagoniensis.
OC Eukaryota; Metazoa; Ecdysozoa; Nematoda; Enoplea; Dorylaimia;
OC Trichinellida; Trichinellidae; Trichinella.
OX NCBI_TaxID=990121 {ECO:0000313|EMBL:KRY09026.1, ECO:0000313|Proteomes:UP000054783};
RN [1] {ECO:0000313|EMBL:KRY09026.1, ECO:0000313|Proteomes:UP000054783}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=ISS2496 {ECO:0000313|EMBL:KRY09026.1};
RA Korhonen P.K., Edoardo P., Giuseppe L.R., Gasser R.B.;
RT "Evolution of Trichinella species and genotypes.";
RL Submitted (JAN-2015) to the EMBL/GenBank/DDBJ databases.
CC -!- SUBCELLULAR LOCATION: Membrane {ECO:0000256|ARBA:ARBA00004370}.
CC Secreted, extracellular space, extracellular matrix, basement membrane
CC {ECO:0000256|ARBA:ARBA00004302}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:KRY09026.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; JYDQ01000296; KRY09026.1; -; Genomic_DNA.
DR Proteomes; UP000054783; Unassembled WGS sequence.
DR GO; GO:0005604; C:basement membrane; IEA:UniProtKB-SubCell.
DR GO; GO:0005581; C:collagen trimer; IEA:UniProtKB-KW.
DR GO; GO:0016020; C:membrane; IEA:UniProtKB-SubCell.
DR GO; GO:0005201; F:extracellular matrix structural constituent; IEA:InterPro.
DR GO; GO:0048856; P:anatomical structure development; IEA:UniProt.
DR Gene3D; 2.170.240.10; Collagen IV, non-collagenous; 1.
DR InterPro; IPR008160; Collagen.
DR InterPro; IPR001442; Collagen_IV_NC.
DR InterPro; IPR036954; Collagen_IV_NC_sf.
DR InterPro; IPR016187; CTDL_fold.
DR PANTHER; PTHR24023; COLLAGEN ALPHA; 1.
DR PANTHER; PTHR24023:SF1104; COLLAGEN ALPHA-1(IV) CHAIN; 1.
DR Pfam; PF01413; C4; 2.
DR Pfam; PF01391; Collagen; 17.
DR SMART; SM00111; C4; 2.
DR SUPFAM; SSF56436; C-type lectin-like; 2.
DR PROSITE; PS51403; NC1_IV; 1.
PE 4: Predicted;
KW Basement membrane {ECO:0000256|ARBA:ARBA00022869};
KW Collagen {ECO:0000256|ARBA:ARBA00023119, ECO:0000313|EMBL:KRY09026.1};
KW Extracellular matrix {ECO:0000256|ARBA:ARBA00022530};
KW Reference proteome {ECO:0000313|Proteomes:UP000054783};
KW Repeat {ECO:0000256|ARBA:ARBA00022737};
KW Secreted {ECO:0000256|ARBA:ARBA00022530}.
FT DOMAIN 1573..1797
FT /note="Collagen IV NC1"
FT /evidence="ECO:0000259|PROSITE:PS51403"
FT REGION 68..112
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 141..192
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 283..454
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 506..557
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 584..759
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 865..1138
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1170..1191
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1276..1340
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1364..1556
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1800 AA; 175071 MW; 5FAA084F997DC2E1 CRC64;
MFIDVKIATV CMSVMRILTT VVVSLLVFFN RQQFTNAYAP PDCKGCAPPC ICPGLKGERG
DVGFMGLPGH PGEPGDPGDE GPEGMQGQVG QQGEPGPVGL KGYRGAQGEP GLRGLPGIPG
LAGFDGPMGP PGIPGCNGTD GRMGAPGLPG LPGSQGPPGM EGAQGPRGDT GEGGINSAGI
KGERGESGKP GTQATKFYYA QECDNNFNKQ YGRPGMPGRD GVKGEKGDIG YVGYPGPPGP
PGPKGAMGLF RAGQKGEKGA AGLPGSPGPP GQLTALRLGE MDILQGPEGN PGPKGEKGDY
GTVGPPGMPG AVGEPGYPGL KGMKGEPGPP GALGKRGKDG TRGAAGEKGS TGDAGGPGRP
GRPGLKGEMG DEGTPGRRGP QGDPGPPGLP GVGRGLKGAR GIEGPDGAPG PKGLPGKDGL
PGLSGKRGPI GLPGPPGEVG VDGRPGYSEK GMKGNDGTII PLKICLHRYP GLRGEPGYPG
MPGLQGPPGE IGFPGSTIYG PPGRDGYPGL DGIPGPPGER GDPGKPGLKG MPGTGKPLVG
PPGQIGFPGL QGDSGRVGVP GRVGAPGLKG LPGDDCGKCP DGTPGVKGTR GDAGLPGYAG
IRGPEGSPGL PGPKGKTGFP GRDGEPGLQG YRGAPGYPGR PGDKGDAGDA IGTPNAGRPG
AKGEPGIPGF PGQRGPMGDA GRDGSPGLPG APGLPGERGL PGSRAKNGRM GKAGFPGPKG
EPGDSFPGPP GIPGMKGIRG DPGIPGFAGP QGAAGPAAPD MILKGEPGLP GRPGIPGQKG
EFGYPGLPGE RGDPGYAGQP GPAGMIGETG PSIPGIPGPK GYPGMKGSNG LPGFPGIPGL
PGKDGIPGVS GTKGVKGDAI VGPIGPAGPQ GSRGVPGEPG VPGLPGQDGA EGDIGQKGYP
GLKGNAGSAG QPGLMGLKGA QGLDGLPGLP GSDGRPGIVG SPGEPGRPGL RGNDGIPGAP
GKRGPIGSAG VPGAAGLPGM AGTPGLKGES GEPGFPGAQG PQGDSGVPGR PGIDGQKGES
GLPGIMGPKG MVGEVGLAGL PGRSGLAGRK GEQGDHGYAG TKGTRGDPGY VAAQGPPGDM
GDPGEVGPAG NPGVAGLQGV RGEKGAPGDS FPGASGNAGE RGDPGLPGLA AKPGFPGPPG
DAGVVGMKGI PGNAGTPGLP GMPGQPGIPG SKGERGNAGI PGVRGSDGAK GEDGLPGQKG
YPGLAGNNGL EGMPGLPGMP GVKGEAGLSG IPGLPGVGGL KGNKGESGLP GYPGPKGDIG
VPGKAGYPGV KGEVGIGGIP GKKGRDGIPG IPGRKGDAGN PGLPGSPGFG VKGDPGFPGL
PGMEGTGSPG AKGERGEAGL PGMMGSAGAS GDPGYPGQIG EKGVPGIPGK RGKKGASGLP
GPRGDAGYPG MKGQGGRPGA EGRPGDFGVQ GQPGPAGDAG FPGRKGESGV PGLVGVPGFP
GAKGDRGDPG SFGLPGFPGL KGDTGDPGPV GPMHPYSEQR HSPPGMRGRP GEPGIQGPAG
LPGSRGMPGP PGVPGARGDK GLAGTPGRPG VPGPKGSSGN FGNAGFPGNV GPPGNPGFPG
VPGIRGGIAP SRGFYFARHS QTTAVPNCPA GTTPMWTGYS LLYIQGDGKS SGQDLGLPGS
CLRKFSTMPF MPCNLNNECH IASRSDYSYW LATEEPMTAS MAPVSGFGIR PYISRCVVCE
LPTQVVALHS QTNDIPRCPR GWTGLWTGYS FIMHTAAGAE GTGQNLQSPG SCLESFRTLP
FIECHGRGTC NHYATNHAFW LAVIDRDMMF KKPYSETLKA GGLKQRVSRC QVCMRNPPVY
//