ID A0A151N2B5_ALLMI Unreviewed; 1935 AA.
AC A0A151N2B5;
DT 08-JUN-2016, integrated into UniProtKB/TrEMBL.
DT 08-JUN-2016, sequence version 1.
DT 24-JAN-2024, entry version 22.
DE SubName: Full=Retinoic acid-induced protein 1 {ECO:0000313|EMBL:KYO30789.1};
GN Name=RAI1 {ECO:0000313|EMBL:KYO30789.1};
GN ORFNames=Y1Q_0008383 {ECO:0000313|EMBL:KYO30789.1};
OS Alligator mississippiensis (American alligator).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC Archelosauria; Archosauria; Crocodylia; Alligatoridae; Alligatorinae;
OC Alligator.
OX NCBI_TaxID=8496 {ECO:0000313|EMBL:KYO30789.1};
RN [1] {ECO:0000313|EMBL:KYO30789.1, ECO:0000313|Proteomes:UP000050525}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=KSC_2009_1 {ECO:0000313|EMBL:KYO30789.1};
RX PubMed=22293439; DOI=10.1186/gb-2012-13-1-415;
RA St John J.A., Braun E.L., Isberg S.R., Miles L.G., Chong A.Y., Gongora J.,
RA Dalzell P., Moran C., Bed'hom B., Abzhanov A., Burgess S.C., Cooksey A.M.,
RA Castoe T.A., Crawford N.G., Densmore L.D., Drew J.C., Edwards S.V.,
RA Faircloth B.C., Fujita M.K., Greenwold M.J., Hoffmann F.G., Howard J.M.,
RA Iguchi T., Janes D.E., Khan S.Y., Kohno S., de Koning A.J., Lance S.L.,
RA McCarthy F.M., McCormack J.E., Merchant M.E., Peterson D.G., Pollock D.D.,
RA Pourmand N., Raney B.J., Roessler K.A., Sanford J.R., Sawyer R.H.,
RA Schmidt C.J., Triplett E.W., Tuberville T.D., Venegas-Anaya M.,
RA Howard J.T., Jarvis E.D., Guillette L.J.Jr., Glenn T.C., Green R.E.,
RA Ray D.A.;
RT "Sequencing three crocodilian genomes to illuminate the evolution of
RT archosaurs and amniotes.";
RL Genome Biol. 13:415-415(2012).
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:KYO30789.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AKHW03004154; KYO30789.1; -; Genomic_DNA.
DR RefSeq; XP_014459798.1; XM_014604312.2.
DR RefSeq; XP_019350277.1; XM_019494732.1.
DR RefSeq; XP_019350278.1; XM_019494733.1.
DR RefSeq; XP_019350279.1; XM_019494734.1.
DR RefSeq; XP_019350280.1; XM_019494735.1.
DR RefSeq; XP_019350281.1; XM_019494736.1.
DR STRING; 8496.A0A151N2B5; -.
DR GeneID; 102573287; -.
DR KEGG; amj:102573287; -.
DR CTD; 10743; -.
DR eggNOG; ENOG502QSNS; Eukaryota.
DR OrthoDB; 2919570at2759; -.
DR Proteomes; UP000050525; Unassembled WGS sequence.
DR GO; GO:0046872; F:metal ion binding; IEA:UniProtKB-KW.
DR Gene3D; 3.30.40.10; Zinc/RING finger domain, C3HC4 (zinc finger); 1.
DR InterPro; IPR034732; EPHD.
DR InterPro; IPR001965; Znf_PHD.
DR InterPro; IPR013083; Znf_RING/FYVE/PHD.
DR PANTHER; PTHR14955; RETINOIC ACID INDUCED 1/TRANSCRIPTION FACTOR 20; 1.
DR PANTHER; PTHR14955:SF6; RETINOIC ACID-INDUCED PROTEIN 1; 1.
DR Pfam; PF13771; zf-HC5HC2H; 1.
DR SMART; SM00249; PHD; 1.
DR SUPFAM; SSF81995; beta-sandwich domain of Sec23/24; 1.
DR PROSITE; PS51805; EPHD; 1.
PE 4: Predicted;
KW Metal-binding {ECO:0000256|ARBA:ARBA00022723};
KW Reference proteome {ECO:0000313|Proteomes:UP000050525};
KW Zinc {ECO:0000256|ARBA:ARBA00022833};
KW Zinc-finger {ECO:0000256|ARBA:ARBA00022771}.
FT DOMAIN 1809..1931
FT /note="PHD-type"
FT /evidence="ECO:0000259|PROSITE:PS51805"
FT REGION 1..39
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 174..233
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 255..292
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 311..363
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 377..410
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 455..506
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 532..561
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 646..677
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 884..923
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1111..1134
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1214..1249
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1273..1292
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1297..1317
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1383..1403
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1483..1503
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1520..1612
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1663..1694
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1825..1846
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 174..227
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 328..363
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 393..408
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 472..506
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 646..667
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 893..923
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1529..1548
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1574..1604
FT /note="Basic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1935 AA; 211166 MW; 3F099601078773E1 CRC64;
MQSFRERCGF HGNQQNYQQT SQDTSRLENY RHQSQAGPNC ERQRLVAKEY YNQQQLPYPG
YENSAVEKYH RGNKQLPGQQ LQGRPSFSNY AVQENSPYPA RYSGDESLQA WGAQPQALAG
GVAKYEENLM KKTPAPPGSR QYHEQASQLP FRTHSLHLQQ PPALTYPKLP RQKIQNDVSS
PMPFSQTPHF TQHSQSFPAS STYSSVPGGS QTAHSYKSCT APSGQPQHER PLGNAANLAS
GQRVQNLHGY QANRMSYDQQ QQQQQQQQQQ QQQQQQQQQP QPQPQPSLQG RHHAQETLHY
QNLAKYQHYS QPGQSYCQGD APPVRTPEQY YQTFSPSSSH SPARSVGRSP SYSSTPSPLM
PNLENFQYSQ QSLSAGTFPA GITDHSHFMP LLNPSPTDGT SPEAQPGNCK SLQKEKIPEN
LLSDLSLQSL TALTSQVENI SNTVQQLLLS KSSMAQKKGI KNPPRTPEQL KGQHCSPESS
TYSAEQVGTP LSDPLSTPQS VHAETQDADY LSGSEDQLER SFLYCGQGRS PARVNSNSKA
KPESVSTCSV TSPDDMSTKS DDSFQSIHAN LPLDTFTKFV TNERECPRLL LSALSQEELA
SEIIVLQDAI NEKVDKAWAD SPSLSKEATK SPFHLENHRT CLDSMVKGTW PSQGDSSTLT
ESLKLDKASG GNNGKDFSEE VYENPSVEFA AAETKNALKD ASSLAYNSKP SIPAATSSSG
AASYSCYSNT TANSVGSEST MEHFDWPDES LSESCLRWKD LGSSLQSSDL SKGLFHSKLA
GSCKEKKNAC SMEMCDGEQP AKNEQAKDFS QQEMGEEEEE TLTYDEATKA DNERWLEDTR
HCCSGGDFSE IPMISSPDLK ESDLEPEEYS SLCELASSEQ KSMIYDTSPP KPPENATVLS
SSDVPVSAEE TVSTVEKENS APSSRLSGQS VILLGPAVGT ETKVKSWFES SLSHIKPEDE
AVGSERTLQG KAESEMPLSV KVKNQVTPEN LLIKSEPTLR AKSLRSKRVH CRLSEREDSG
KLVPSLIKDV PAAGVVGSTC VGPESQIETP SKNAHGQTPR FPAEGLPARM CTRSFTALTE
PRTPAPLEGL KASDHQEKLG KKSACAMKQR AAFKARKRSG KPAPKGVQNP SDLAPVLVPN
LVQNDDLVGQ KPKDLEAPET EVKDQRSMIL RSRTKTQEVF YTKRRRERRT VEVRLKNCKA
PKKLLSNNHL SPAFKLASQG SPHKEGKVGK RMKLPKPGAG MGSKMSERPL HSLKRKSTFL
SPIPAKKRNL VLRSNSSGAK DEKPDASPSL FKRMPSTKKA KAKLPTKSPC EAVLKPPPAK
ETPDVCIKIT SRAAFQGATK TKVLPPRKGR GLKLEAIVQK ITSPNLKKFA CKTAAAAAAA
ATARRNPLSP SAAEKERALK HGSVTPAVGE ARPLSQGMAQ KSPAASVAEQ LCRNSNNRSL
KGKLMNGKKL SSDCFRGEAC LSPEPAQHGG SMAAKSLGLL PKKRSRKGKA AALGLAKNPL
DKRAPLAPPL LLTARERAAA GNTLGNGEEG QRDGKKPKSE DKDFANGEGP EGRAAPGQAR
APKQRANHSN YNGYSKRQRK RLARGKAKNV ASRCKSRGKR RRQQQQAPLL HPTEPEIRLK
YISCKRLRTD SRALPFSPYV RVEKHNEFTT TCTVINSPGD EARLQKEQHR SSAAQALPPS
GAAPQPRAAL PSSSTMHLGP VVSKTLNATC LVCCVCRNPA NYKDLGDLCG PYYPEDCLPK
KKSRLKDKIK VEGPSDEAPL PLPAERVLKA TDNCPASAVV GKVPRLDSAA DSAKQSALRS
SSRGMFRKLQ SCYCCDERTE GEEAAEKPKR HECSKPESPA QEPAGDTQEH WVHEACAIWT
AGVFLVAGKL YGLQEAIKAA AEVKCSSCQQ TGATVGCCHK GCAQTYHYAC AIDTGCLLTE
ESFSLKCPKH KRHPL
//