ID A0A091GSL8_BUCRH Unreviewed; 1667 AA.
AC A0A091GSL8;
DT 26-NOV-2014, integrated into UniProtKB/TrEMBL.
DT 26-NOV-2014, sequence version 1.
DT 24-JAN-2024, entry version 29.
DE SubName: Full=Collagen alpha-2(IV) chain {ECO:0000313|EMBL:KFO86164.1};
DE Flags: Fragment;
GN ORFNames=N320_04181 {ECO:0000313|EMBL:KFO86164.1};
OS Buceros rhinoceros silvestris.
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC Archelosauria; Archosauria; Dinosauria; Saurischia; Theropoda;
OC Coelurosauria; Aves; Neognathae; Bucerotiformes; Bucerotidae; Buceros.
OX NCBI_TaxID=175836 {ECO:0000313|EMBL:KFO86164.1, ECO:0000313|Proteomes:UP000054064};
RN [1] {ECO:0000313|EMBL:KFO86164.1, ECO:0000313|Proteomes:UP000054064}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=BGI_N320 {ECO:0000313|EMBL:KFO86164.1};
RA Zhang G., Li C.;
RT "Genome evolution of avian class.";
RL Submitted (APR-2014) to the EMBL/GenBank/DDBJ databases.
CC -!- FUNCTION: Type IV collagen is the major structural component of
CC glomerular basement membranes (GBM), forming a 'chicken-wire' meshwork
CC together with laminins, proteoglycans and entactin/nidogen.
CC {ECO:0000256|ARBA:ARBA00003696}.
CC -!- SUBCELLULAR LOCATION: Membrane {ECO:0000256|ARBA:ARBA00004370}.
CC Secreted, extracellular space, extracellular matrix, basement membrane
CC {ECO:0000256|ARBA:ARBA00004302}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; KL510099; KFO86164.1; -; Genomic_DNA.
DR Proteomes; UP000054064; Unassembled WGS sequence.
DR GO; GO:0005604; C:basement membrane; IEA:UniProtKB-SubCell.
DR GO; GO:0005581; C:collagen trimer; IEA:UniProtKB-KW.
DR GO; GO:0016020; C:membrane; IEA:UniProtKB-SubCell.
DR GO; GO:0005201; F:extracellular matrix structural constituent; IEA:InterPro.
DR Gene3D; 2.170.240.10; Collagen IV, non-collagenous; 1.
DR InterPro; IPR008160; Collagen.
DR InterPro; IPR001442; Collagen_IV_NC.
DR InterPro; IPR036954; Collagen_IV_NC_sf.
DR InterPro; IPR016187; CTDL_fold.
DR PANTHER; PTHR24023; COLLAGEN ALPHA; 1.
DR PANTHER; PTHR24023:SF714; COLLAGEN ALPHA-4(IV) CHAIN; 1.
DR Pfam; PF01413; C4; 2.
DR Pfam; PF01391; Collagen; 13.
DR SMART; SM00111; C4; 2.
DR SUPFAM; SSF56436; C-type lectin-like; 2.
DR PROSITE; PS51403; NC1_IV; 1.
PE 4: Predicted;
KW Basement membrane {ECO:0000256|ARBA:ARBA00022869};
KW Collagen {ECO:0000256|ARBA:ARBA00023119, ECO:0000313|EMBL:KFO86164.1};
KW Extracellular matrix {ECO:0000256|ARBA:ARBA00022530};
KW Reference proteome {ECO:0000313|Proteomes:UP000054064};
KW Secreted {ECO:0000256|ARBA:ARBA00022530}.
FT DOMAIN 1443..1667
FT /note="Collagen IV NC1"
FT /evidence="ECO:0000259|PROSITE:PS51403"
FT REGION 38..232
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 246..795
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 815..939
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1007..1440
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 323..337
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 526..545
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 858..873
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1094..1114
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT NON_TER 1
FT /evidence="ECO:0000313|EMBL:KFO86164.1"
FT NON_TER 1667
FT /evidence="ECO:0000313|EMBL:KFO86164.1"
SQ SEQUENCE 1667 AA; 163093 MW; D99C30144C00B1C5 CRC64;
WLLVVFSAQD VDGGGYAYLD PCGGQDCSVC RCFPEKGSRG QPGELGAQGP IGSLGSTGPA
GLPGEKGRRG EDGQPGPAGE KGDKGPTGVP GFPGLDGVPG LPGREGPRGK PGLDGCNGSR
GDPGFPGEIG YTGPRGPYVR EGRQKGEKGN SVYVSHFGKG PPGDRGDPGP PGMPGPRGSR
GTMGPSGYPG QPGLPGIPGY PGLPGEQGSP GIGVDGQKGE PGDIGLPGPP GSPLLVGPPG
AQLFKGEKGQ KGLPGLTGHR GPRGPKGELG RGEKGEKGVA GFPGLQGAPG SYGRTGFPGL
KGETGFAGFP GQAGYPGVQG DPGERGPPGP PGAVGTPLHP IKGPQGDPGF PGPAGDMGSV
GPAGPAGLIG SPGDDGTSLP GLPGVSGAPG PRGFPGDPGF PGTGESIAGR PGFPGPPGLP
GQPGRQGLPG LPSVICTDRG IPGEAGAKGQ MGLPGRKGEK GEKGNPGPCS CTAGPPGPRG
VQGSPGAPGR KGHMGYPGSH GEKGDPGLAG AVGSPGLPGR PGSAGQHGEK GEKGDPGRVR
TKGMKGERGP AGAQGSPGQR GNDGRDGELG LPGQKGAEGD CGVALPGDKG FPGVPGLPGV
QGQIGLPGLG FPGPPGVRGS PGDTGDTGSA GPPGPKGQKG ETICIPSPRP GSPGPPGFKG
VQGPKGLKGF PGRPGPHGFD GQKGLRGRPG AGIPGPEGFR GDAGDPGDEG ERGPSVDGSH
GPPGPPGIDG QKGVRGDTTY GPPGIPGAPG LPGPPGAQGA RGDPGVPGLQ GQLGTPGFPG
AKGFRGPEGD RGAPGCPGFP GLPCIAGLPG PPGLRGATGL PGPQGLPGFK GQRGDRGLAG
IPGIKGLKGS HGSQGPPGPP GFRGPPGLPG NRGPPGFPGQ TGSKGIPGPQ GFPGLPGTQG
PMGISGVKGE EGNMGPPGPG GECGDTGLRG ERGPPGDSGC INIRLEKGQK GEPGFPGEDG
FIGERGEKGS TGFRGAPGLP GKNGVPGLPG DHGDTGLMGF PGLRGFPGPR GSKGMMGFQG
QSGDQGDVGL PGIPGEAGRA GPRGPKGERG DPTPLLGTRG RKGPPGDPGL PGLCGLPGEK
GSPGIQGEPG RPGSKGDPGP PGIPGFPGAP GPQGLPGEPG EKGKHGILGP PGLQGLPGSH
GRKGLPGLPG LDGLDGLKGQ KGSAGAPGQS ETGPPGHPGE PGPKGDRGEP GWPGVSIPGP
PGERGFPGFP GRRGPVGPTG PMGRSPDSAS PGPPGVQGPP GLDGIRGHPG NPGPPGETIF
VRGDPGDTGI RGAPGPPGQR GQQGARGVPG NLGRTGPKGP MGIHGPQGPL GAVGQPGDEG
FQGIPGPRGP PGTEEPVAHC DPGEPGKRDD SCPAIPGPPG DAGPRGEDGS AGSPGPIGHP
GPHGRKGEEG SCGLPGPHGS PGAPGPPGDQ GDRGEQGHVG PQGPPGQTGI PGPPGPQVRS
ASGFLLVLHS QSDREPLCPQ GMPKLWTGYS LLYLEGQEKA HNQDLGLAGS CLPVFNTMPF
AYCNINQVCY YASRNDKSYW LSSAAPLPMT PLSEEEIQPY ISRCAVCEAP AQAVAVHSQD
QTIPSCPVNW RSLWIGYSFL MHTGSGDQGG GQSLMSPGSC LEDFRSAPFI ECQGQRGTCQ
FFATKYSFWL TTVLPELQFV SAPLSGTLKE GQEQRRRISR CQVCLKH
//