ID A0A1A6GN06_NEOLE Unreviewed; 1299 AA.
AC A0A1A6GN06;
DT 05-OCT-2016, integrated into UniProtKB/TrEMBL.
DT 05-OCT-2016, sequence version 1.
DT 24-JAN-2024, entry version 24.
DE RecName: Full=DUF4599 domain-containing protein {ECO:0008006|Google:ProtNLM};
DE Flags: Fragment;
GN ORFNames=A6R68_03854 {ECO:0000313|EMBL:OBS67591.1};
OS Neotoma lepida (Desert woodrat).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Euarchontoglires; Glires; Rodentia; Myomorpha; Muroidea;
OC Cricetidae; Neotominae; Neotoma.
OX NCBI_TaxID=56216 {ECO:0000313|EMBL:OBS67591.1, ECO:0000313|Proteomes:UP000092124};
RN [1] {ECO:0000313|EMBL:OBS67591.1, ECO:0000313|Proteomes:UP000092124}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=417 {ECO:0000313|EMBL:OBS67591.1};
RC TISSUE=Liver {ECO:0000313|EMBL:OBS67591.1};
RA Campbell M., Oakeson K.F., Yandell M., Halpert J.R., Dearing D.;
RT "The Draft Genome Sequence and Annotation of the Desert Woodrat Neotoma
RT lepida.";
RL Submitted (JUN-2016) to the EMBL/GenBank/DDBJ databases.
CC -!- SUBCELLULAR LOCATION: Membrane {ECO:0000256|ARBA:ARBA00004167}; Single-
CC pass membrane protein {ECO:0000256|ARBA:ARBA00004167}.
CC -!- SIMILARITY: Belongs to the SPATA31 family.
CC {ECO:0000256|ARBA:ARBA00035009}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:OBS67591.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; LZPO01078268; OBS67591.1; -; Genomic_DNA.
DR STRING; 56216.A0A1A6GN06; -.
DR Proteomes; UP000092124; Unassembled WGS sequence.
DR GO; GO:0016020; C:membrane; IEA:UniProtKB-SubCell.
DR InterPro; IPR039509; SPATA31.
DR InterPro; IPR027970; SPATA31F3-like.
DR PANTHER; PTHR21859; ACROSOME-SPECIFIC PROTEIN; 1.
DR PANTHER; PTHR21859:SF15; PROTEIN FAM205A-RELATED; 1.
DR Pfam; PF15371; DUF4599; 1.
DR Pfam; PF14650; FAM75; 2.
PE 3: Inferred from homology;
KW Membrane {ECO:0000256|SAM:Phobius};
KW Reference proteome {ECO:0000313|Proteomes:UP000092124};
KW Transmembrane {ECO:0000256|SAM:Phobius};
KW Transmembrane helix {ECO:0000256|SAM:Phobius}.
FT TRANSMEM 16..34
FT /note="Helical"
FT /evidence="ECO:0000256|SAM:Phobius"
FT DOMAIN 55..134
FT /note="SPATA31F3-like"
FT /evidence="ECO:0000259|Pfam:PF15371"
FT DOMAIN 404..438
FT /note="SPATA31"
FT /evidence="ECO:0000259|Pfam:PF14650"
FT DOMAIN 435..612
FT /note="SPATA31"
FT /evidence="ECO:0000259|Pfam:PF14650"
FT REGION 438..479
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 594..631
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 685..740
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 906..966
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 987..1246
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 594..615
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 707..721
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 939..966
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 997..1014
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1017..1059
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1060..1078
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1093..1108
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1148..1164
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT NON_TER 1
FT /evidence="ECO:0000313|EMBL:OBS67591.1"
SQ SEQUENCE 1299 AA; 145709 MW; 7A8E1D6A1D7BD715 CRC64;
VWTMLSSTYF LWDIDYPLYV YFYIFIIILI IWQVRQNYHR LKCEHKRSCC QRHRKVKQRA
RDAALRARRL SQEEAEKPWE LLDIMKSQSW LPQEEHVRQL LCADPCCQVC DAASLEIQQL
LQSEKSQTSP VLLELPSGSC LEMLPISSVS FEQSMELHSE HSRHLALASG TPTLAQLTEH
LTQSTSAVGV QQCYADFLQV GQEFHRTDMP MVSETVSSSR LRESVVLVTA GETXHSNFNY
VQRIQDHQSF NSQISLQTLN PESTHVIHPM TLSLVTTVPQ PFLSPDVLRL LELHVKKLVH
FQKWGLPRHV EESLRQLMPN QPMYFQPENN QPVSLILSDT SQDYVDRFGN ISHQTLCSYT
DSQPTQTFWV SEWSNVDLEQ KTQTPDHNVL RGICLLPEGQ ANNSRSNLQK KFTQLFCGLP
SRHSESLVNT FLGTQDLSKD TPKPSHKELH LLKNSAPIPL LPHTPPKATP PTSSTSPNES
LYEHQEAQLS VPFLIPAECK TLERRLLQRQ LRWGLRGVIP RPPHVQSHIQ YKPCNKARSH
ETLKASFPGK SFSVLTRDLF FVPDHARRLL EFHLQKHLIH SSWGLPQRTQ HSMNLLLSST
DQQSRSCSSR VPPNVSIPQP GDPEDDGSDD TFALAVDKGS IPTPHLFSQT KSMLKSHVDS
KYDQIHQGNV PACVQSSWEC RTPGSSAAAA PFPKIPPDQP LELQAENNPD LHHKVVPPEP
EALDQEKQAS SGAFIEHCKR PQTLPEETIK KLETTLRHKY LAFLSGMPAL YCVIPSRTIS
PVIVSQSATR EMLPGPVKIP QEPLTQMISL EDPCRSGLDP CTQDDKEAST DITEEVQSEV
QVEGRTENVS LENQKEILEK LNFHLKKKIL EIRLGIPMTA DELIEGNAAG PDSESMQEFV
GSLDIPEGTA LQKLPSSGHS XPAPDANRVH LQKQPATAEQ AVCHKQRQPS SKAVPHRSVQ
WGSKASKLRS TIEAQVYCVQ METSGEKPXL EEPFSTEPQS PGKSKXSAQV PTLTEKSEEP
GEPKAVRDLG ERDADHRLSP TSEKTHHDGD QELEERPLHK TPQGSSQQRH SFHLEDPCQP
SPQNPPELEF PDPHPEVFIE REPGHDIQDS QTKVNVIPIA KVPQPVASQA SWGQPFPQPP
IQGKPYGGQT WQDHSSWGQV RPTSPHASPS PPEAGLKTKM KSFLHAINPK IKGKTHVEPM
VSTPGKAAKT SKENVDKGLP QAKSPTKKTK TENXRGPKAQ SASSEKSVIT SLLTVPYILD
SKLWPRPRQR ASVSATGRPR HCPRHCPQLA YAIQYRNPP
//