ID A0A091GD43_9AVES Unreviewed; 1619 AA.
AC A0A091GD43;
DT 26-NOV-2014, integrated into UniProtKB/TrEMBL.
DT 26-NOV-2014, sequence version 1.
DT 27-MAR-2024, entry version 28.
DE SubName: Full=Collagen alpha-2(IV) chain {ECO:0000313|EMBL:KFO79868.1};
DE Flags: Fragment;
GN ORFNames=N303_02419 {ECO:0000313|EMBL:KFO79868.1};
OS Cuculus canorus (common cuckoo).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC Archelosauria; Archosauria; Dinosauria; Saurischia; Theropoda;
OC Coelurosauria; Aves; Neognathae; Cuculiformes; Cuculidae; Cuculus.
OX NCBI_TaxID=55661 {ECO:0000313|EMBL:KFO79868.1, ECO:0000313|Proteomes:UP000053760};
RN [1] {ECO:0000313|EMBL:KFO79868.1, ECO:0000313|Proteomes:UP000053760}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=BGI_N303 {ECO:0000313|EMBL:KFO79868.1};
RA Zhang G., Li C.;
RT "Genome evolution of avian class.";
RL Submitted (APR-2014) to the EMBL/GenBank/DDBJ databases.
CC -!- FUNCTION: Type IV collagen is the major structural component of
CC glomerular basement membranes (GBM), forming a 'chicken-wire' meshwork
CC together with laminins, proteoglycans and entactin/nidogen.
CC {ECO:0000256|ARBA:ARBA00003696}.
CC -!- SUBCELLULAR LOCATION: Membrane {ECO:0000256|ARBA:ARBA00004370}.
CC Secreted, extracellular space, extracellular matrix, basement membrane
CC {ECO:0000256|ARBA:ARBA00004302}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; KL448071; KFO79868.1; -; Genomic_DNA.
DR STRING; 55661.A0A091GD43; -.
DR Proteomes; UP000053760; Unassembled WGS sequence.
DR GO; GO:0005604; C:basement membrane; IEA:UniProtKB-SubCell.
DR GO; GO:0005581; C:collagen trimer; IEA:UniProtKB-KW.
DR GO; GO:0016020; C:membrane; IEA:UniProtKB-SubCell.
DR GO; GO:0005201; F:extracellular matrix structural constituent; IEA:InterPro.
DR Gene3D; 2.170.240.10; Collagen IV, non-collagenous; 1.
DR InterPro; IPR008160; Collagen.
DR InterPro; IPR001442; Collagen_IV_NC.
DR InterPro; IPR036954; Collagen_IV_NC_sf.
DR InterPro; IPR016187; CTDL_fold.
DR PANTHER; PTHR24023:SF1019; COLLAGEN; 1.
DR PANTHER; PTHR24023; COLLAGEN ALPHA; 1.
DR Pfam; PF01413; C4; 2.
DR Pfam; PF01391; Collagen; 13.
DR SMART; SM00111; C4; 2.
DR SUPFAM; SSF56436; C-type lectin-like; 2.
DR PROSITE; PS51403; NC1_IV; 1.
PE 4: Predicted;
KW Basement membrane {ECO:0000256|ARBA:ARBA00022869};
KW Collagen {ECO:0000256|ARBA:ARBA00023119, ECO:0000313|EMBL:KFO79868.1};
KW Extracellular matrix {ECO:0000256|ARBA:ARBA00022530};
KW Reference proteome {ECO:0000313|Proteomes:UP000053760};
KW Secreted {ECO:0000256|ARBA:ARBA00022530}.
FT DOMAIN 1400..1619
FT /note="Collagen IV NC1"
FT /evidence="ECO:0000259|PROSITE:PS51403"
FT REGION 1..185
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 210..257
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 269..543
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 569..751
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 767..886
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 903..949
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 981..1394
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 146..160
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 365..384
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 489..506
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT NON_TER 1
FT /evidence="ECO:0000313|EMBL:KFO79868.1"
FT NON_TER 1619
FT /evidence="ECO:0000313|EMBL:KFO79868.1"
SQ SEQUENCE 1619 AA; 159118 MW; F0ABDBE20610D3DE CRC64;
QGQPGELGPQ GPIGSLGFTG PPGLPGEKGQ RGENGQPGPA GEKGDKGPTG VPGFPGLDGV
PGLPGSGGPR GNRGLDGCNG SRGDPGFPGE NGYIGSRGPY GNPGQKGEKG NSVYVTHFGR
GPPGDSGDPG LPGMPGPRGS RGAPGPSGYP GPPGLPGIPG YPGLPGKQGN PGIGVDGQKG
EPGDIGLPGP PGSPLLVGPP GAQLFKGEKG QKGLPGVTGH RGPRGPKGVL GRGEKGEKGI
PRSPGLQGHP GSYGVAGFPG MKGEMGFAGF PGKPGYPGIQ GDPGEEGPPG PPGAVGTLLL
PIKGPQGDPG FPGPAGDVGS VGPTGPAGFL GRPGDDGTSL PGLPGVSGPP GPQGFQGDPG
FPGTGEPIPG SPGFPGPPGL PGQPGRQGLP GLPRVICTDR GIPGEPGAQG QMGLPGRKGE
KGEKGNQGLC SCVAGPPGPR GVQGPPGTQG KKGQMGYPGR RGEKGDSGLS GAVGSPGLPG
TPGSAGQHGE KGEKGDPGRI RIKGIKGERG PAGVPGFPGQ RGNDGRDGEV GLPGEKGAEG
DSGVPLPGDT GFPGVPGLPG IKGQMGLPGL GFPGPPGVRG SPGDFGDTGS VGPPGPKGQK
GDTVCITLPY PGNPGPPGFK GVQGPKGLKG LPGHPGPNGF DGQKGHQGRP GTGIPGPEGF
RGQPGDPGDE GERGDTVDGK NGPPGPPGID GQKGVPGDTT YGPPGIPGNR GLPGPPGAQG
ARGEPVFFEP GTPGFPGAKG FRGPEGDVGA PGCPGFPGLP CDTALPGPPG LRGVMGMPGP
QGLPGFKGQR GDRGLAGSPG IKGLKGSRGS QGPPGPPGSR GFPGLPGNKG PPGLPGQTGS
KGIQGPQGFP GLPGTQGPMG ITGVKGEEGK MGPPGPSGEC GDMGIKGERG LPGDSGWVNI
RLEKGQQGEP GFPGENGSRG ERGEKGNTGF RGTPGLPGKN GAPGMQGDHG DTGVMGFPGP
RGFPGPKGFR GILGFQGQPG DQGDTGLPGI PGNPGLTGRK GSKGRRGDVT ALLGAHGQRG
PPGDPGLPGL CGFPGEKGSH GIQGQPGGPG SKGDTGYPGI PGLPGATGPQ GLPGEHGEEG
KHGISGPPGL QGLPGSQGRK GLPGLPGLDG LDGLKGQKGS AGAPGQSDTG APGYPGELGP
KGDRGEPGWP GISIPGPHGE RGFPGYPGRR GPVGPTGPMG RSPDSASPGH PGDQGPPGLD
GMRGHPGNPG PPGETIFVRG DPGDIGNRGA PGNPGLRGQQ GARGPPGNQG RRGPKGPMGI
HGPQGPPGAI GQPGDQGFQG KPGRRGPTGT YEAVVHCDPG EPGKTDDSCP TIPGPPGDAG
QRGDDGSVGL PGPIGHPGPQ GRKGEEGSCG LPGQDGLPGP PGPPGDQGNR GEQGFAGPQG
PPGQTGIPGP PGPHIRSASG FLLVLHSQLC PQGMPKLWTG YSLLYLEGQE KAHNQDLGLA
GSCLPVFNTM PFAYCNINQV CYYASRNDKS YWLSSAAPLP MAPLSEEEIQ PYISRCAVCE
APAQVVAVHS QDQSIPPCPV NWRSLWIGYS FLMHTGSGDQ GGGQSLMSPG SCLEDFRSAP
FIECQGQRGT CQFFANEYSF WLTTVMPELQ FASAPLSGTL KEGQEQRKKI SRCQVCLKH
//