ID A0A091VJC8_NIPNI Unreviewed; 1311 AA.
AC A0A091VJC8;
DT 26-NOV-2014, integrated into UniProtKB/TrEMBL.
DT 26-NOV-2014, sequence version 1.
DT 24-JAN-2024, entry version 20.
DE SubName: Full=Fanconi anemia group I protein {ECO:0000313|EMBL:KFR02603.1};
DE Flags: Fragment;
GN ORFNames=Y956_09739 {ECO:0000313|EMBL:KFR02603.1};
OS Nipponia nippon (Crested ibis) (Ibis nippon).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC Archelosauria; Archosauria; Dinosauria; Saurischia; Theropoda;
OC Coelurosauria; Aves; Neognathae; Pelecaniformes; Threskiornithidae;
OC Nipponia.
OX NCBI_TaxID=128390 {ECO:0000313|EMBL:KFR02603.1, ECO:0000313|Proteomes:UP000053283};
RN [1] {ECO:0000313|EMBL:KFR02603.1, ECO:0000313|Proteomes:UP000053283}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=BGI_Y956 {ECO:0000313|EMBL:KFR02603.1};
RA Zhang G., Li C.;
RT "Genome evolution of avian class.";
RL Submitted (APR-2014) to the EMBL/GenBank/DDBJ databases.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; KL411013; KFR02603.1; -; Genomic_DNA.
DR STRING; 128390.A0A091VJC8; -.
DR eggNOG; KOG4553; Eukaryota.
DR Proteomes; UP000053283; Unassembled WGS sequence.
DR GO; GO:0006281; P:DNA repair; IEA:InterPro.
DR CDD; cd11720; FANCI; 1.
DR InterPro; IPR026171; FANCI.
DR InterPro; IPR029310; FANCI_HD1.
DR InterPro; IPR029312; FANCI_HD2.
DR InterPro; IPR029308; FANCI_S1.
DR InterPro; IPR029305; FANCI_S1-cap.
DR InterPro; IPR029315; FANCI_S2.
DR InterPro; IPR029313; FANCI_S3.
DR InterPro; IPR029314; FANCI_S4.
DR PANTHER; PTHR21818; BC025462 PROTEIN; 1.
DR PANTHER; PTHR21818:SF0; FANCONI ANEMIA GROUP I PROTEIN; 1.
DR Pfam; PF14679; FANCI_HD1; 1.
DR Pfam; PF14680; FANCI_HD2; 1.
DR Pfam; PF14675; FANCI_S1; 1.
DR Pfam; PF14674; FANCI_S1-cap; 1.
DR Pfam; PF14676; FANCI_S2; 1.
DR Pfam; PF14677; FANCI_S3; 1.
DR Pfam; PF14678; FANCI_S4; 1.
PE 4: Predicted;
KW Reference proteome {ECO:0000313|Proteomes:UP000053283}.
FT DOMAIN 1..26
FT /note="FANCI solenoid 1 cap"
FT /evidence="ECO:0000259|Pfam:PF14674"
FT DOMAIN 37..254
FT /note="FANCI solenoid 1"
FT /evidence="ECO:0000259|Pfam:PF14675"
FT DOMAIN 259..345
FT /note="FANCI helical"
FT /evidence="ECO:0000259|Pfam:PF14679"
FT DOMAIN 353..516
FT /note="FANCI solenoid 2"
FT /evidence="ECO:0000259|Pfam:PF14676"
FT DOMAIN 530..762
FT /note="FANCI helical"
FT /evidence="ECO:0000259|Pfam:PF14680"
FT DOMAIN 780..1004
FT /note="FANCI solenoid 3"
FT /evidence="ECO:0000259|Pfam:PF14677"
FT DOMAIN 1017..1270
FT /note="FANCI solenoid 4"
FT /evidence="ECO:0000259|Pfam:PF14678"
FT REGION 1273..1311
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT NON_TER 1
FT /evidence="ECO:0000313|EMBL:KFR02603.1"
FT NON_TER 1311
FT /evidence="ECO:0000313|EMBL:KFR02603.1"
SQ SEQUENCE 1311 AA; 146125 MW; 3C3F4C15D91610B7 CRC64;
QLGDMVTRQA LKGRETAALL RAIFKGSPCS QQSGVLRRLQ VYKHCVPLVE SGDLHLGKVS
EIIGLLMLEA RQLPGHALAE LAALFVDVIK GGSLSNGKSL ELFSTVLTAL ASSKESLAYG
KGELNGEEFK KQLINTLCSS KWDPQSVIHL ANMFRDIPLS GEELQFVVEK VLRMFSKLDL
QEIPPLVYQL LLLSAKGSKK TVLEGIISFF NQLDKRQKEE QRVPQSVDLE VATMPLDQLR
HVEGTVILHI VSVINLDQDL GEELIRHLKT EQQKDPGKAL CPFSVALLLS VAVKHRLQEQ
IFDFLKTSIT RSCKDLQFLQ ASKFLQDLFP QQYDVTAVIL EVVKNSAFGW DHVTQGLVDL
GFSLMESYEP KKPFGGKAAD TSYGLSKIPA QQACKLGANI LLETFKVHEP IRSDILEQVL
NRVLTKAASP VSHFIDLLSN IVVSAPLVLQ TSSSKVTETF DNLSFLPIDT VQGLLRAVQP
LLKVSMSVRD SLILVLQKAI FSRQLDARKA AVAGFLLLLR NFKVLGSLSS SQCSQAIGAT
QVQADVHACY NSAANEAFCL EILGSLRRCL SQQADVRLML YEGFYDVLRR NSQLASSIME
TLLSQIKQYY LPQQDLLPPL KLEGCIMAQG DQIFLQEPLA HLLCCIQHCL AWYKSTVQLC
QGAEDDDEEE DVGFQENFED MLESVTRRMI KSELEDFELD KSADFSLSSG VGVKNNIYAI
QVMGICEVLI EYNFNIGNFS KNKFEDILGL FTCYNKLSEI LKEKAGKNKS TLGNKTARSF
LSMGFVSTLL TALFRDNAQT HEESLAVLRS STEFLRYAVS VALQKVQQLE ETGQTDGPDG
QNPEKMFQNL CKITRVLLWR YTSIPTVVEE SGKKKGKSIS LLCLEGLLRI FNTVQQLYTA
RIPQFLQALD ITDGDAEETD INVTEKAAFQ IRQFQRSLVN QFSSAEDDFN SKETQSLITV
LSTLSKLLDP ASQQFLQFLT WTVKICKENA LEDIACCKGL LSLLFSLHVL YKSPVSLLRE
LAQDIHACLG DIDQDVEVES RSHFAIVNAK TAAPTVCLLV LGQADKVLEE VDWLIKKLTS
LGSDTSEDSS QASNQTQALE KGVILQLGTL LTVYHELVQT ALPAGSCVDT LLRSLSKTYA
ILTSLIKHYI QACRSTSNTI PGRLEKLVKL SGSHLTPQCY SFITYVQNIH SESLSFAEEK
KKKKKEDAAA AISTVMAKVL RETKPIPNLI FAIEQYEKFL IHLSKKSKVN LMQYMKLSTS
RDFRINASML DSALQEHNTE NAENEPDNDQ SSTAEQTDEN QEPEKKRQRK K
//