GenomeNet

Database: UniProt
Entry: A0A151NAK6_ALLMI
LinkDB: A0A151NAK6_ALLMI
Original site: A0A151NAK6_ALLMI 
ID   A0A151NAK6_ALLMI        Unreviewed;      1640 AA.
AC   A0A151NAK6;
DT   08-JUN-2016, integrated into UniProtKB/TrEMBL.
DT   08-JUN-2016, sequence version 1.
DT   27-MAR-2024, entry version 23.
DE   RecName: Full=Collagen IV NC1 domain-containing protein {ECO:0000259|PROSITE:PS51403};
GN   ORFNames=Y1Q_0024496 {ECO:0000313|EMBL:KYO33868.1};
OS   Alligator mississippiensis (American alligator).
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC   Archelosauria; Archosauria; Crocodylia; Alligatoridae; Alligatorinae;
OC   Alligator.
OX   NCBI_TaxID=8496 {ECO:0000313|EMBL:KYO33868.1};
RN   [1] {ECO:0000313|EMBL:KYO33868.1, ECO:0000313|Proteomes:UP000050525}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=KSC_2009_1 {ECO:0000313|EMBL:KYO33868.1};
RX   PubMed=22293439; DOI=10.1186/gb-2012-13-1-415;
RA   St John J.A., Braun E.L., Isberg S.R., Miles L.G., Chong A.Y., Gongora J.,
RA   Dalzell P., Moran C., Bed'hom B., Abzhanov A., Burgess S.C., Cooksey A.M.,
RA   Castoe T.A., Crawford N.G., Densmore L.D., Drew J.C., Edwards S.V.,
RA   Faircloth B.C., Fujita M.K., Greenwold M.J., Hoffmann F.G., Howard J.M.,
RA   Iguchi T., Janes D.E., Khan S.Y., Kohno S., de Koning A.J., Lance S.L.,
RA   McCarthy F.M., McCormack J.E., Merchant M.E., Peterson D.G., Pollock D.D.,
RA   Pourmand N., Raney B.J., Roessler K.A., Sanford J.R., Sawyer R.H.,
RA   Schmidt C.J., Triplett E.W., Tuberville T.D., Venegas-Anaya M.,
RA   Howard J.T., Jarvis E.D., Guillette L.J.Jr., Glenn T.C., Green R.E.,
RA   Ray D.A.;
RT   "Sequencing three crocodilian genomes to illuminate the evolution of
RT   archosaurs and amniotes.";
RL   Genome Biol. 13:415-415(2012).
CC   -!- FUNCTION: Type IV collagen is the major structural component of
CC       glomerular basement membranes (GBM), forming a 'chicken-wire' meshwork
CC       together with laminins, proteoglycans and entactin/nidogen.
CC       {ECO:0000256|ARBA:ARBA00003696}.
CC   -!- SUBCELLULAR LOCATION: Membrane {ECO:0000256|ARBA:ARBA00004370}.
CC       Secreted, extracellular space, extracellular matrix, basement membrane
CC       {ECO:0000256|ARBA:ARBA00004302}.
CC   -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC       whole genome shotgun (WGS) entry which is preliminary data.
CC       {ECO:0000313|EMBL:KYO33868.1}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; AKHW03003627; KYO33868.1; -; Genomic_DNA.
DR   Proteomes; UP000050525; Unassembled WGS sequence.
DR   GO; GO:0005604; C:basement membrane; IEA:UniProtKB-SubCell.
DR   GO; GO:0005581; C:collagen trimer; IEA:UniProtKB-KW.
DR   GO; GO:0016020; C:membrane; IEA:UniProtKB-SubCell.
DR   GO; GO:0005201; F:extracellular matrix structural constituent; IEA:InterPro.
DR   Gene3D; 2.170.240.10; Collagen IV, non-collagenous; 1.
DR   InterPro; IPR008160; Collagen.
DR   InterPro; IPR001442; Collagen_IV_NC.
DR   InterPro; IPR036954; Collagen_IV_NC_sf.
DR   InterPro; IPR016187; CTDL_fold.
DR   PANTHER; PTHR24023:SF1019; COLLAGEN; 1.
DR   PANTHER; PTHR24023; COLLAGEN ALPHA; 1.
DR   Pfam; PF01413; C4; 2.
DR   Pfam; PF01391; Collagen; 18.
DR   SMART; SM00111; C4; 2.
DR   SUPFAM; SSF56436; C-type lectin-like; 2.
DR   PROSITE; PS51403; NC1_IV; 1.
PE   4: Predicted;
KW   Basement membrane {ECO:0000256|ARBA:ARBA00022869};
KW   Collagen {ECO:0000256|ARBA:ARBA00023119};
KW   Extracellular matrix {ECO:0000256|ARBA:ARBA00022530};
KW   Reference proteome {ECO:0000313|Proteomes:UP000050525};
KW   Secreted {ECO:0000256|ARBA:ARBA00022530}.
FT   DOMAIN          1422..1640
FT                   /note="Collagen IV NC1"
FT                   /evidence="ECO:0000259|PROSITE:PS51403"
FT   REGION          30..442
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          463..868
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          884..1400
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        40..54
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        396..415
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        548..571
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1038..1052
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   1640 AA;  159365 MW;  A4A86F8ADF30408C CRC64;
     MPLTLNCVCK AKSSCYCDGT KGNKGEKGFP GAQGPPGIIG FPGPEGPPGP QGPKGSPGLM
     GFHGHKGVRG KPGLPGFSGS PGLPGEPGKA GPPGQPGIPG CDGIKGEQGF PGLPGRKGAP
     GRPGVDGIKG EKGCPPSVYD EGDDAKGSPG MPGIPGVQGD PGPRGLQGPV GPQGSPGIPG
     IRGRVGPPGP KGLMGAKVIG TKGQKGDTGL TGPPGPPGTV IVTLSQPNNM TGLKGEKGEK
     GSRGLPGQVG SPGLPGDSQS EQGDLGEPGP QGKPGKDGHP GPPGFQQGEK GEQGEPGIAG
     GRGPKGMKGN IGPPGLCGPT EYYDLQPGEK GDEGPPGLPG PKGQPGAPGS RGHPGGTGRP
     GISSPGPKGP YGYPGPKGKK GEQGQPGINV VGPPGLPGLP GTPGSVGPPG PPGQPGEVMF
     ERGPPGIPGN PGYMGPPGLP GPRGMKGEDI CACTDCTYIP GPSGPSGAPG PDGAPGNRGG
     EGLPGPKGSQ GSPGTQGPPG LQGYPGLPGD QGTKGPKGKP GHIYPEGQQG DQGNPGIRGY
     PGKKGADGFH GRPGRKGSKG SKGEEAQRGQ KGDRGLPGPS GSPGPSGDIG LPGYLGYGPP
     GITGSKGIRG TPGLPGLPGG NGPKGEPGTI NRIPGPPGLR GPDGSPGPPG AKGHLGSRGQ
     RGNPGFPGPK GDQGFPGIGF PGNLGPKGSK GLPGSKGSPG CSKNGEAGVP GSPGFPGEKG
     DPGSSIPGEP GRLGTPGTNG LPGTKGENGL LGFPGVPGHP GREGTVGLPG EPGFPGAPGD
     DGPPGKPGDC HNGSPGPTGR QGAPGRQGDR GILGAKGDRG ISGMSYPGFP GEPGFPGLPG
     PKGAVGSPGL KGLTGRPGNP GTVGQTGEPG VMGDLGMRGP PGIFGNPGTP GKRGSHGLPG
     PKGAKGSFGE KGKKGDKGFL GPVLSRVSAG DKGEPGSKGS PGRTGLRGVK GSEGVQGSAG
     DPGTRGIPAP PGSKGLPGLP GFHGKQGLPG APGERGDIGM PGSKGESGSP GSPGPKGFIG
     LPGIDGVLGD KGEPSYPEFG PPGRPGPKGG PGFPGIPGTK GKRGLQGQPG PVGDPGLPGS
     KGDFGEPGTP GLPGHAGEPG AKGYKGKPGE RGTGGLPGAV GSPGADGPVG ERGSQGHDGF
     PGSPGEKGEL GALGLGVPGP QGPLGPKGSK GGRGLPGFPG LPGAKGLMGE QGPAGPSGLN
     GPPGLPGVPG EIITGQKGNK GIAGIDGKPG LPGMPGPPGL TIYRRGDGGD RGADGIPGTP
     GPVGDTGSPG PIGINGVPGH PGPAGDVGFP GFPGVKGEKG NQGPVGPTGK PGPRGPKGPP
     GHPGMNRKRM CTPGERGPPG SPGNPGIPGD KGHQGVPGLP GCQGVKGRPG SHGSPGLPGP
     LGPKGDKGFK GDTGQPGLIG FPGLPGIPGY PGAFIPSPAR RGFIFTRHSQ SIMMPSCPSG
     TSHIYSGYSL LSVQGNEQAH GQDLGTAGSC LQRFTTVPFL FCNTNKICNF ASRNDYSYWL
     STAATMPPDM APVSGRALEP YISRCIVCEG PAMVIAIHSQ TTAVPLCPDG WISLWKGFSF
     VMYTGAGSEA SGQALASPGS CLEEFRAVPF IECHSRGTCN YYANSYSFWL ASLDPRRMFR
     KPLPQTMKSG ELENIISRCQ
//
DBGET integrated database retrieval system