ID A0A087VE35_BALRE Unreviewed; 1360 AA.
AC A0A087VE35;
DT 29-OCT-2014, integrated into UniProtKB/TrEMBL.
DT 29-OCT-2014, sequence version 1.
DT 27-MAR-2024, entry version 29.
DE SubName: Full=Collagen alpha-1(IV) chain {ECO:0000313|EMBL:KFO10877.1};
DE Flags: Fragment;
GN ORFNames=N312_01097 {ECO:0000313|EMBL:KFO10877.1};
OS Balearica regulorum gibbericeps (East African grey crowned-crane).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC Archelosauria; Archosauria; Dinosauria; Saurischia; Theropoda;
OC Coelurosauria; Aves; Neognathae; Gruiformes; Gruidae; Balearica.
OX NCBI_TaxID=100784 {ECO:0000313|EMBL:KFO10877.1, ECO:0000313|Proteomes:UP000053309};
RN [1] {ECO:0000313|EMBL:KFO10877.1, ECO:0000313|Proteomes:UP000053309}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=BGI_N312 {ECO:0000313|EMBL:KFO10877.1};
RA Zhang G., Li C.;
RT "Genome evolution of avian class.";
RL Submitted (APR-2014) to the EMBL/GenBank/DDBJ databases.
CC -!- FUNCTION: Type IV collagen is the major structural component of
CC glomerular basement membranes (GBM), forming a 'chicken-wire' meshwork
CC together with laminins, proteoglycans and entactin/nidogen.
CC {ECO:0000256|ARBA:ARBA00003696}.
CC -!- SUBCELLULAR LOCATION: Membrane {ECO:0000256|ARBA:ARBA00004370}.
CC Secreted, extracellular space, extracellular matrix, basement membrane
CC {ECO:0000256|ARBA:ARBA00004302}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; KL489826; KFO10877.1; -; Genomic_DNA.
DR Proteomes; UP000053309; Unassembled WGS sequence.
DR GO; GO:0005604; C:basement membrane; IEA:UniProtKB-SubCell.
DR GO; GO:0005581; C:collagen trimer; IEA:UniProtKB-KW.
DR GO; GO:0016020; C:membrane; IEA:UniProtKB-SubCell.
DR GO; GO:0005201; F:extracellular matrix structural constituent; IEA:InterPro.
DR Gene3D; 2.170.240.10; Collagen IV, non-collagenous; 1.
DR InterPro; IPR008160; Collagen.
DR InterPro; IPR001442; Collagen_IV_NC.
DR InterPro; IPR036954; Collagen_IV_NC_sf.
DR InterPro; IPR016187; CTDL_fold.
DR PANTHER; PTHR24023:SF1019; COLLAGEN; 1.
DR PANTHER; PTHR24023; COLLAGEN ALPHA; 1.
DR Pfam; PF01413; C4; 2.
DR Pfam; PF01391; Collagen; 15.
DR SMART; SM00111; C4; 2.
DR SUPFAM; SSF56436; C-type lectin-like; 2.
DR PROSITE; PS51403; NC1_IV; 1.
PE 4: Predicted;
KW Basement membrane {ECO:0000256|ARBA:ARBA00022869};
KW Collagen {ECO:0000256|ARBA:ARBA00023119, ECO:0000313|EMBL:KFO10877.1};
KW Extracellular matrix {ECO:0000256|ARBA:ARBA00022530};
KW Reference proteome {ECO:0000313|Proteomes:UP000053309};
KW Secreted {ECO:0000256|ARBA:ARBA00022530}.
FT DOMAIN 1136..1360
FT /note="Collagen IV NC1"
FT /evidence="ECO:0000259|PROSITE:PS51403"
FT REGION 1..119
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 167..1133
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 55..76
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 98..119
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 169..184
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 336..352
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 380..420
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 474..512
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 524..562
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 782..832
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 907..948
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1049..1077
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT NON_TER 1
FT /evidence="ECO:0000313|EMBL:KFO10877.1"
FT NON_TER 1360
FT /evidence="ECO:0000313|EMBL:KFO10877.1"
SQ SEQUENCE 1360 AA; 130453 MW; 677EBF7171C40AE5 CRC64;
GLPGGPGING APGKEGEKVV SRPGIDATVG PKGSKGLPGL PGAKGERGFS GRPGPPGLPG
SPGITTVGPP GPPGLPGERG QKGDPGLPGV SIPGQPGLDG PRGPPGPPGP PGPPAPSVPP
VSFPSRVECR YFLFLNLMFK YAFQMQIFVG DQGDTCFNCI GTGITGPPGE RGPPGPPGSP
GSPGFPGPKG EKGLPGLTGL VGPPGFPGSP GVPGRPGPKG DPGDVLTSPR MKGDKGDPGF
PGPPGLPGID GTPGRDGLPG LPGPKGEPGS VAFKGEMGIP GDPGIPGLPG DKGLPGPPGF
GPQGLPGEKG IQGVSGRPGP PGVPGPKGDP GQTITEPGVP GPPGPPGRNG DPGLPGDPGQ
PGQRGLSGIP GAKGEPGIPG IGLPGPPGPK GFPGTPGPPG APGTPGRPGL DGPPGPPGFP
GQKGDRGFGV PGPPGPPGPP GMKGVQGPKG DPGFPGNPGL PGRAGFDGTP GPKGDPGPSG
PPGLPGPPGI PGIGGQGPPG SPGPPGPVGP PGLQGIPGEK GDPGPPGFDV PGPPGDQGAP
GYPGPPGLPG PQGSPGPPGR DGIPGFPGAK GEMGVMGAPG PPGPPGTPGR NGLPGLKGNN
GLPGPPGPPG PVGQKGIKGE AGLPGPPGKV DSKQLGAKGE KGEPGVPGIP GLSGQKGYQG
LPGDPGPPGL SGPPGVPGLP GIKGDTGLPG QPGPTGPPGL KGAIGEMGLP GPPGIKGSQG
IAGRPGQPGP AGFPGLKGEK GDPGLSSIGI PGLPGPKGDL GLPGYPGSPG SKGIAGSPGL
PGFPGSPGPK GEPGLPGFPG TPGVPGPKGI EGPPGNPGLP GPPGPAGDIG RPGPPGPSGE
KGQPGRDGIP GPAGQKGEPG LPGFGRPGPP GLPGLSGQKG ELGLPGPPGP PGLPGLKGEP
GFQGFPGLQG PPGPPGLPGP PLEGPKGSPG PPGVPGRPGP PGPEGPRGPP GSGGLKGEKG
NPGPPGPPGL TGQKGDQGPP GHQGDPGHPG LNGMKGDPGV PGVPGFPGMK GPTGPAGPAG
LTGSQGLPGP PGPPGLPGTI GRSIVVKGDP GPPGPPGQPG SKGPPGLPGP QGLPGPIGLP
GDPGRDGLPG FDGPAGRKGE RGLPGQPGSR GTQGPPGPDG LQGPPGPPGT ASVAHGFLIT
RHSQTRDTPL CPQGTSRIYD GFSLLYVQGN ERAHGQDLGT AGSCLRRFST MPFMFCNINN
VCNFASRNDY SYWLSTPEPM PMSMEPLTGQ SIQPFISRCV VCEAPAMVIA VHSQTIQIPS
CPPGWDSLWI GYSFMMHTSA GAEGSGQALA SPGSCLEEFR SAPFIECHGR GTCNYYANSY
SFWLATVEVS EMFSKPQSET LKAGDLRTRI SRCQVCMKKT
//