ID A0A151X8W6_9HYME Unreviewed; 1905 AA.
AC A0A151X8W6;
DT 08-JUN-2016, integrated into UniProtKB/TrEMBL.
DT 08-JUN-2016, sequence version 1.
DT 27-MAR-2024, entry version 25.
DE SubName: Full=Collagen alpha-1(IV) chain {ECO:0000313|EMBL:KYQ56760.1};
GN ORFNames=ALC60_04359 {ECO:0000313|EMBL:KYQ56760.1};
OS Trachymyrmex zeteki.
OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; Pterygota;
OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Aculeata; Formicoidea;
OC Formicidae; Myrmicinae; Trachymyrmex.
OX NCBI_TaxID=64791 {ECO:0000313|EMBL:KYQ56760.1, ECO:0000313|Proteomes:UP000075809};
RN [1] {ECO:0000313|EMBL:KYQ56760.1, ECO:0000313|Proteomes:UP000075809}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Tzet28-1 {ECO:0000313|EMBL:KYQ56760.1};
RC TISSUE=Whole body {ECO:0000313|EMBL:KYQ56760.1};
RA Nygaard S., Hu H., Boomsma J., Zhang G.;
RT "Trachymyrmex zeteki WGS genome.";
RL Submitted (SEP-2015) to the EMBL/GenBank/DDBJ databases.
CC -!- SUBCELLULAR LOCATION: Membrane {ECO:0000256|ARBA:ARBA00004370}.
CC Secreted, extracellular space, extracellular matrix, basement membrane
CC {ECO:0000256|ARBA:ARBA00004302}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; KQ982409; KYQ56760.1; -; Genomic_DNA.
DR STRING; 64791.A0A151X8W6; -.
DR Proteomes; UP000075809; Unassembled WGS sequence.
DR GO; GO:0005604; C:basement membrane; IEA:UniProtKB-SubCell.
DR GO; GO:0005581; C:collagen trimer; IEA:UniProtKB-KW.
DR GO; GO:0016020; C:membrane; IEA:UniProtKB-SubCell.
DR GO; GO:0005201; F:extracellular matrix structural constituent; IEA:InterPro.
DR GO; GO:0048856; P:anatomical structure development; IEA:UniProt.
DR Gene3D; 2.170.240.10; Collagen IV, non-collagenous; 1.
DR InterPro; IPR008160; Collagen.
DR InterPro; IPR001442; Collagen_IV_NC.
DR InterPro; IPR036954; Collagen_IV_NC_sf.
DR InterPro; IPR016187; CTDL_fold.
DR PANTHER; PTHR24023; COLLAGEN ALPHA; 1.
DR PANTHER; PTHR24023:SF1082; COLLAGEN ALPHA-1(X) CHAIN; 1.
DR Pfam; PF01413; C4; 2.
DR Pfam; PF01391; Collagen; 14.
DR SMART; SM00111; C4; 2.
DR SUPFAM; SSF56436; C-type lectin-like; 2.
DR PROSITE; PS51403; NC1_IV; 1.
PE 4: Predicted;
KW Basement membrane {ECO:0000256|ARBA:ARBA00022869};
KW Collagen {ECO:0000256|ARBA:ARBA00023119, ECO:0000313|EMBL:KYQ56760.1};
KW Extracellular matrix {ECO:0000256|ARBA:ARBA00022530};
KW Reference proteome {ECO:0000313|Proteomes:UP000075809};
KW Secreted {ECO:0000256|ARBA:ARBA00022530};
KW Signal {ECO:0000256|ARBA:ARBA00022729}.
FT DOMAIN 1600..1823
FT /note="Collagen IV NC1"
FT /evidence="ECO:0000259|PROSITE:PS51403"
FT REGION 60..108
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 237..438
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 452..611
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 715..768
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 930..977
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1029..1055
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1178..1225
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1412..1599
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1839..1905
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 249..280
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 316..331
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1844..1864
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1877..1892
FT /note="Basic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1905 AA; 193334 MW; D1822C77072E92C5 CRC64;
MIPERRSLRA CSVPECVVDT SKTFIYKANV WKHDSVFVLF HICKDSYKCN CKGIKGQPGF
PGVPGPQGTE GHPGDIGPDG PPGPKGEKGA AGEYGATGEK GYRGNPGVSG VMGPDGLDGC
NGTDGRAGAP GIPGVFGPRG VPGPSGDYGF PGEPGDGGIN SVGAKGIRGN PGIDGLEGYR
GQQGLTGEMG YPGDTGDFVS TVTPVLENEE RDKLDNPTLD GFRSFLRNCS HEISFADPKY
SPAMFNGVPG KEEEKKWKES ERISRESRFP PRDKHRSKES SYGLSGSPGL PGMQGERGDT
VYGVKGQLGE QGDRGEPGSP GKDEYKKKEN SLPVKGPLGA KGMKGFYGDR GRKGELGAKG
PQGRPGFLGI KGIKGEQGDD GPRGKQGVGG PPGPAGEKGE KGAPGFAGMP GPDGKEGEPG
EEGNRGPPGR QGAEGQPGQY IPELDEIIAG NIGIQGDLGP PGESGEPGTP GLPGKVGFIG
PPGPPGLPGN PGFAGAKGSS IKGEQGDDGL PGPVGPQGSS GLPGLPGIMG AKGFPGVTIT
GPPGPDGNPG LPGLSGPPGD RGDPGPPGEK GFPGKGIRIR GPPGERGLPG TPGISGPDGW
PGFTGPKGGK GIRGDDCGVC APGLPGEKGQ RGESGLDGYP GTTGFSGLPG PRGLKGVPGK
RGLSGPAGLE GDSGRSGIAG IQGPKGEQGK IFFPPGEIIV SPPGDFGEKG LPGPEGIRGL
RGQPGPIGDT GVPGPKGDKG EHGLPGLSGK DGLPGWDGIP GEKGEDASIP IEFLRGDPGY
SGEEGIKGIQ GDKGNKGEPG VSAEYIADSQ GDKGYKGARG APGENGFKGF EGRPGDEGLL
GLPGEPGSPG ISLQGPIGLK GYPGMPGDQG LRGTPGSPGL QGSVGFPGRI GNKGERGEPG
SNLTYGEIGE RGWYGPKGDF GDAGPQGLRG RSGEDGVKGI KGGKGAPGYE GLSGLEGAKG
QRGDSIQGDR GFPGQPGMPG MSGRIGDVGI RGPDGPSGFD GMKGLKGEIG FIGRRGDDGD
MGSQGYQGMH GTQGLPGAPG EKGDVGESGD SGSRGWPGFE GIDGAPGSKG EMGDAGIPGF
PGVPGQPGFK GIAGDTGLEG LSGIVGENSF SGPQGDVGEH GFMGEVGPPG LAGFDGRPGL
PGEPGLATDG FDGLQGQKGE SGFDGYNGRE GPKGEIGDMG PDGLKGEQGE QGFPGRSGAH
GIPGRHGAKG ERGPIGPPGF DGLKGKPGLD GDPGIMGRPD KMLHHLGLMG APGPKGYIGE
PGLPSYAFAE KGDFGNRGHD GFSGVKGQSG EPGYIGVPGR KGQIGDRGYP GPDGLLGLPG
FPGLPGDQGP RGAPGLLGEF AEPGEPGRAG LDGMPGIPGR AGQKGAPGEY GPNGPKGIFG
DIGHSLRGPK GMVGEPGLRG FVGFPGAPGL EGLQGFPGAP GPKGLRGEPG ISADSAKGQP
GEPGWRGLDG RTGPKGMRGD SGLSGIDELP GQKGERGETG EPGLPGLPGP EGPKGSKGES
GINPFPPFPR RGLPGDIGVP GLPGFPGLQG MIGEPGKDGL PGRQGETGMI GIPGLQGPDG
KDGPPGYQGP PGSTGRQGQP GFPGAPGLPA PNGIGPRDRG FYFARHSQSE MIPVCPRNTV
KIWEGFSLLH IMGNGRAYAQ DLGAAGSCIR KFSTMPFLFC NLNNVCDYAN RNDYSYWLST
TEPMPMMMTP IPAPEVGRYL SRCSVCEAPT RVIAVHSQSM TIPECPGGWE EIWVGYSFLM
HRDAGADGGG QSLVSPGSCL EEFRVRPFIE CRGLGSCNYF ATATSYWLAT IHDDEMFRKP
LQQTLKADHT SRVSRCAVCL RRRVTEVLKP FTSNNERAPN NWGVAAPAPP PPPRNLPPDY
ESSRIPPNPW ATRGRYSNRR VPHHQRRRGR VRGRHWESTE SFNNV
//