ID A0A151NAK6_ALLMI Unreviewed; 1640 AA.
AC A0A151NAK6;
DT 08-JUN-2016, integrated into UniProtKB/TrEMBL.
DT 08-JUN-2016, sequence version 1.
DT 27-MAR-2024, entry version 23.
DE RecName: Full=Collagen IV NC1 domain-containing protein {ECO:0000259|PROSITE:PS51403};
GN ORFNames=Y1Q_0024496 {ECO:0000313|EMBL:KYO33868.1};
OS Alligator mississippiensis (American alligator).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC Archelosauria; Archosauria; Crocodylia; Alligatoridae; Alligatorinae;
OC Alligator.
OX NCBI_TaxID=8496 {ECO:0000313|EMBL:KYO33868.1};
RN [1] {ECO:0000313|EMBL:KYO33868.1, ECO:0000313|Proteomes:UP000050525}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=KSC_2009_1 {ECO:0000313|EMBL:KYO33868.1};
RX PubMed=22293439; DOI=10.1186/gb-2012-13-1-415;
RA St John J.A., Braun E.L., Isberg S.R., Miles L.G., Chong A.Y., Gongora J.,
RA Dalzell P., Moran C., Bed'hom B., Abzhanov A., Burgess S.C., Cooksey A.M.,
RA Castoe T.A., Crawford N.G., Densmore L.D., Drew J.C., Edwards S.V.,
RA Faircloth B.C., Fujita M.K., Greenwold M.J., Hoffmann F.G., Howard J.M.,
RA Iguchi T., Janes D.E., Khan S.Y., Kohno S., de Koning A.J., Lance S.L.,
RA McCarthy F.M., McCormack J.E., Merchant M.E., Peterson D.G., Pollock D.D.,
RA Pourmand N., Raney B.J., Roessler K.A., Sanford J.R., Sawyer R.H.,
RA Schmidt C.J., Triplett E.W., Tuberville T.D., Venegas-Anaya M.,
RA Howard J.T., Jarvis E.D., Guillette L.J.Jr., Glenn T.C., Green R.E.,
RA Ray D.A.;
RT "Sequencing three crocodilian genomes to illuminate the evolution of
RT archosaurs and amniotes.";
RL Genome Biol. 13:415-415(2012).
CC -!- FUNCTION: Type IV collagen is the major structural component of
CC glomerular basement membranes (GBM), forming a 'chicken-wire' meshwork
CC together with laminins, proteoglycans and entactin/nidogen.
CC {ECO:0000256|ARBA:ARBA00003696}.
CC -!- SUBCELLULAR LOCATION: Membrane {ECO:0000256|ARBA:ARBA00004370}.
CC Secreted, extracellular space, extracellular matrix, basement membrane
CC {ECO:0000256|ARBA:ARBA00004302}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:KYO33868.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AKHW03003627; KYO33868.1; -; Genomic_DNA.
DR Proteomes; UP000050525; Unassembled WGS sequence.
DR GO; GO:0005604; C:basement membrane; IEA:UniProtKB-SubCell.
DR GO; GO:0005581; C:collagen trimer; IEA:UniProtKB-KW.
DR GO; GO:0016020; C:membrane; IEA:UniProtKB-SubCell.
DR GO; GO:0005201; F:extracellular matrix structural constituent; IEA:InterPro.
DR Gene3D; 2.170.240.10; Collagen IV, non-collagenous; 1.
DR InterPro; IPR008160; Collagen.
DR InterPro; IPR001442; Collagen_IV_NC.
DR InterPro; IPR036954; Collagen_IV_NC_sf.
DR InterPro; IPR016187; CTDL_fold.
DR PANTHER; PTHR24023:SF1019; COLLAGEN; 1.
DR PANTHER; PTHR24023; COLLAGEN ALPHA; 1.
DR Pfam; PF01413; C4; 2.
DR Pfam; PF01391; Collagen; 18.
DR SMART; SM00111; C4; 2.
DR SUPFAM; SSF56436; C-type lectin-like; 2.
DR PROSITE; PS51403; NC1_IV; 1.
PE 4: Predicted;
KW Basement membrane {ECO:0000256|ARBA:ARBA00022869};
KW Collagen {ECO:0000256|ARBA:ARBA00023119};
KW Extracellular matrix {ECO:0000256|ARBA:ARBA00022530};
KW Reference proteome {ECO:0000313|Proteomes:UP000050525};
KW Secreted {ECO:0000256|ARBA:ARBA00022530}.
FT DOMAIN 1422..1640
FT /note="Collagen IV NC1"
FT /evidence="ECO:0000259|PROSITE:PS51403"
FT REGION 30..442
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 463..868
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 884..1400
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 40..54
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 396..415
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 548..571
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1038..1052
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1640 AA; 159365 MW; A4A86F8ADF30408C CRC64;
MPLTLNCVCK AKSSCYCDGT KGNKGEKGFP GAQGPPGIIG FPGPEGPPGP QGPKGSPGLM
GFHGHKGVRG KPGLPGFSGS PGLPGEPGKA GPPGQPGIPG CDGIKGEQGF PGLPGRKGAP
GRPGVDGIKG EKGCPPSVYD EGDDAKGSPG MPGIPGVQGD PGPRGLQGPV GPQGSPGIPG
IRGRVGPPGP KGLMGAKVIG TKGQKGDTGL TGPPGPPGTV IVTLSQPNNM TGLKGEKGEK
GSRGLPGQVG SPGLPGDSQS EQGDLGEPGP QGKPGKDGHP GPPGFQQGEK GEQGEPGIAG
GRGPKGMKGN IGPPGLCGPT EYYDLQPGEK GDEGPPGLPG PKGQPGAPGS RGHPGGTGRP
GISSPGPKGP YGYPGPKGKK GEQGQPGINV VGPPGLPGLP GTPGSVGPPG PPGQPGEVMF
ERGPPGIPGN PGYMGPPGLP GPRGMKGEDI CACTDCTYIP GPSGPSGAPG PDGAPGNRGG
EGLPGPKGSQ GSPGTQGPPG LQGYPGLPGD QGTKGPKGKP GHIYPEGQQG DQGNPGIRGY
PGKKGADGFH GRPGRKGSKG SKGEEAQRGQ KGDRGLPGPS GSPGPSGDIG LPGYLGYGPP
GITGSKGIRG TPGLPGLPGG NGPKGEPGTI NRIPGPPGLR GPDGSPGPPG AKGHLGSRGQ
RGNPGFPGPK GDQGFPGIGF PGNLGPKGSK GLPGSKGSPG CSKNGEAGVP GSPGFPGEKG
DPGSSIPGEP GRLGTPGTNG LPGTKGENGL LGFPGVPGHP GREGTVGLPG EPGFPGAPGD
DGPPGKPGDC HNGSPGPTGR QGAPGRQGDR GILGAKGDRG ISGMSYPGFP GEPGFPGLPG
PKGAVGSPGL KGLTGRPGNP GTVGQTGEPG VMGDLGMRGP PGIFGNPGTP GKRGSHGLPG
PKGAKGSFGE KGKKGDKGFL GPVLSRVSAG DKGEPGSKGS PGRTGLRGVK GSEGVQGSAG
DPGTRGIPAP PGSKGLPGLP GFHGKQGLPG APGERGDIGM PGSKGESGSP GSPGPKGFIG
LPGIDGVLGD KGEPSYPEFG PPGRPGPKGG PGFPGIPGTK GKRGLQGQPG PVGDPGLPGS
KGDFGEPGTP GLPGHAGEPG AKGYKGKPGE RGTGGLPGAV GSPGADGPVG ERGSQGHDGF
PGSPGEKGEL GALGLGVPGP QGPLGPKGSK GGRGLPGFPG LPGAKGLMGE QGPAGPSGLN
GPPGLPGVPG EIITGQKGNK GIAGIDGKPG LPGMPGPPGL TIYRRGDGGD RGADGIPGTP
GPVGDTGSPG PIGINGVPGH PGPAGDVGFP GFPGVKGEKG NQGPVGPTGK PGPRGPKGPP
GHPGMNRKRM CTPGERGPPG SPGNPGIPGD KGHQGVPGLP GCQGVKGRPG SHGSPGLPGP
LGPKGDKGFK GDTGQPGLIG FPGLPGIPGY PGAFIPSPAR RGFIFTRHSQ SIMMPSCPSG
TSHIYSGYSL LSVQGNEQAH GQDLGTAGSC LQRFTTVPFL FCNTNKICNF ASRNDYSYWL
STAATMPPDM APVSGRALEP YISRCIVCEG PAMVIAIHSQ TTAVPLCPDG WISLWKGFSF
VMYTGAGSEA SGQALASPGS CLEEFRAVPF IECHSRGTCN YYANSYSFWL ASLDPRRMFR
KPLPQTMKSG ELENIISRCQ
//