ID A0A1R3J190_9ROSI Unreviewed; 953 AA.
AC A0A1R3J190;
DT 12-APR-2017, integrated into UniProtKB/TrEMBL.
DT 12-APR-2017, sequence version 1.
DT 13-SEP-2023, entry version 16.
DE RecName: Full=DUF4378 domain-containing protein {ECO:0008006|Google:ProtNLM};
GN ORFNames=COLO4_20185 {ECO:0000313|EMBL:OMO88595.1};
OS Corchorus olitorius.
OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae;
OC rosids; malvids; Malvales; Malvaceae; Grewioideae; Apeibeae; Corchorus.
OX NCBI_TaxID=93759 {ECO:0000313|EMBL:OMO88595.1, ECO:0000313|Proteomes:UP000187203};
RN [1] {ECO:0000313|Proteomes:UP000187203}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=cv. O-4 {ECO:0000313|Proteomes:UP000187203};
RA Alam M., Haque M.S., Islam M.S., Emdad E.M., Islam M.M., Ahmed B.,
RA Halim A., Hossen Q.M.M., Hossain M.Z., Ahmed R., Khan M.M., Islam R.,
RA Rashid M.M., Khan S.A., Rahman M.S., Alam M., Yahiya A.S., Khan M.S.,
RA Azam M.S., Haque T., Lashkar M.Z.H., Akhand A.I., Morshed G., Roy S.,
RA Uddin K.S., Rabeya T., Hossain A.S., Chowdhury A., Snigdha A.R.,
RA Mortoza M.S., Matin S.A., Hoque S.M.E., Islam M.K., Roy D.K., Haider R.,
RA Moosa M.M., Elias S.M., Hasan A.M., Jahan S., Shafiuddin M., Mahmood N.,
RA Shommy N.S.;
RT "Corchorus olitorius genome sequencing.";
RL Submitted (SEP-2013) to the EMBL/GenBank/DDBJ databases.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:OMO88595.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AWUE01017043; OMO88595.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A1R3J190; -.
DR STRING; 93759.A0A1R3J190; -.
DR OrthoDB; 543602at2759; -.
DR Proteomes; UP000187203; Unassembled WGS sequence.
DR InterPro; IPR022212; DUF3741.
DR InterPro; IPR025486; DUF4378.
DR PANTHER; PTHR47212; ADHESIN-LIKE PROTEIN, PUTATIVE (DUF3741)-RELATED; 1.
DR PANTHER; PTHR47212:SF4; ADHESIN-LIKE PROTEIN, PUTATIVE (DUF3741)-RELATED; 1.
DR Pfam; PF12552; DUF3741; 1.
DR Pfam; PF14309; DUF4378; 1.
PE 4: Predicted;
KW Reference proteome {ECO:0000313|Proteomes:UP000187203}.
FT DOMAIN 224..268
FT /note="DUF3741"
FT /evidence="ECO:0000259|Pfam:PF12552"
FT DOMAIN 787..934
FT /note="DUF4378"
FT /evidence="ECO:0000259|Pfam:PF14309"
FT REGION 101..146
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 313..366
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 396..429
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 562..611
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 687..712
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 322..336
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 396..424
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 575..589
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 695..712
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 953 AA; 106947 MW; 54A9C3F5452741BB CRC64;
MAKRSNRRPV RYEKDQLGCM WGLISMFDFR HGRTTQRLLS DRRRGNRNAV GVGISGNKLA
MLPSSGENSP GTLDNEEKKA AIDACKPSVK RLLEEEMCGE QTAKKHEKNS QVEVKLCDSG
QGESQRKSRK RKSKTRKRSC GGSSIDMDAS EDLVLEGSCQ HKPALQTTSS VDIDNLMEEF
YQQINQKRIN CVNHDQPAEE HMLPNQKSSG FEERLSEAIK FLVSQKLING NQITEDGEVQ
ASKEVMDALQ ILSLDEELFL KLLRDPNSLL VKYVQDMPDA QTKKEEESKA LAGSNISEQD
LVALRQSNEP VNRKQRNFFR RKLKSQERDL SDGDKDSQAS NKIVILKPGP PSLQTPENGS
SLDSSAESQY IIRQRVENEK VGSHFFLSEI KRKLKHAMGR EQHRNPTDGT SKRFPGERKS
SGDSGGVKEY IGMNSPTKDH FFIERIARPS QGVKKGEKTI KLRGSEHGTE SETTDFSRQR
VSNIYIEARR HLSEMLTNGD ENVDLSRRPN PKTLGRILSL PEYNSSPVGS PGRNSEAGFV
TAQMRFAGSD NQHNHVSNLS HVSETTESEL CVSDDKTGNE VQGNDAISNK SDTTDDKTSN
EVQDDDAISN NLDTCVNDDK EDQIFGSTRD EMSSEGAVNV DKVTEIMVEE ESKIISSFSE
TSDSSITRDD KNVDTCDITD EKHYTEDLKQ DSCEEEQQPI SPLASPSNSS VNKKVECLES
ATDIQERPSP VSVLEPFADD VISPASIRSH SETSIQPLRI RFEEHDSSAT NQTNHVKTCM
DDKESILEYI KAVLQASSFN WDELYIRSLS SDQLLDPLLL GGVEYLPNQL CQDENLLFDC
INEVLMEVCG QYYGFPGVSF LKPNIRPLPN MKNTIEEVWQ GVHWHLLPMP VPRTLDQIVR
KDLAMTGTWM DLRLDTDCIG VEMGEAILED LVEDTITSYI DESLECEYHP LPA
//