PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Previous version: v3.0 v4.0
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Aradu.D4VWB
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; fabids; Fabales; Fabaceae; Papilionoideae; Dalbergieae; Arachis
Family HD-ZIP
Protein Properties Length: 768aa    MW: 83964.3 Da    PI: 5.9787
Description HD-ZIP family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Aradu.D4VWBgenomeNCGR_PGCView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1Homeobox66.43.9e-2187142156
                  TT--SS--HHHHHHHHHHHHHSSS--HHHHHHHHHHCTS-HHHHHHHHHHHHHHHH CS
     Homeobox   1 rrkRttftkeqleeLeelFeknrypsaeereeLAkklgLterqVkvWFqNrRakek 56 
                  ++k +++t++q++eLe++F+ +++p++++r++L+++lgL+ +qVk+WFqNrR+++k
  Aradu.D4VWB  87 KKKYHRHTPHQTQELEAFFKDCNHPDEKQRADLSRRLGLENKQVKFWFQNRRTQMK 142
                  79999************************************************999 PP

2START1488.5e-472965102206
                  HHHHHHHHHHHHHHHC-TT-EEEE...EXCCTTEEEEEEESSS......SCEEEEEEEECCSCHHHHHHHHHCCCGGCT-TT-S....EEEEEEEECT CS
        START   2 laeeaaqelvkkalaeepgWvkss...esengdevlqkfeeskv.....dsgealrasgvvdmvlallveellddkeqWdetla....kaetleviss 87 
                  la+ a++el+k+a+a++p+W k+    e++n++e+ +  +   +     + +ea+r++gvv+ ++  lve+++d   +W+e+++    +a tl+vi s
  Aradu.D4VWB 296 LALVAMDELIKMAQADSPLWIKMDggkEILNQEEYARMSSFI-SptspgYVTEASRETGVVIINSSALVETMMDPE-RWSEMFPsmvsRAVTLDVIAS 391
                  67889*********************9999999987655322.1356789****************9999988888.********************* PP

                  T......EEEEEEEEXXTTXX-SSX.EEEEEEEEEEE.TTS-EEEEEEEEE-TTS--.-TTSEE-EESSEEEEEEEECTCEEE..EEEEE-EE--SSX CS
        START  88 g......galqlmvaelqalsplvp.RdfvfvRyirqlgagdwvivdvSvdseqkppesssvvRaellpSgiliepksnghsk..vtwvehvdlkgrl 176
                  g      g lq+m+ae q+lsplvp R   f+R+++q+ +g w++vd S+d  ++           +lpSg+l+++++ng sk  +twveh  ++++l
  Aradu.D4VWB 392 GmggsrnGSLQVMQAEIQLLSPLVPvRQLSFLRFCKQHAEGLWAVVDASIDAGRNFK---------RLPSGFLVQDTPNGFSKasITWVEHSQYDESL 480
                  ******************************************************975.........7**************65449************ PP

                  XHHHHHHHHHHHHHHHHHHHHHHTXXXXXX CS
        START 177 phwllrslvksglaegaktwvatlqrqcek 206
                  +h+l+r+l+ sgl +ga +w atlqrqce+
  Aradu.D4VWB 481 IHQLYRPLIASGLGFGAPRWIATLQRQCEC 510
                  ****************************96 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
Gene3DG3DSA:1.10.10.601.7E-2173144IPR009057Homeodomain-like
SuperFamilySSF466897.52E-2074144IPR009057Homeodomain-like
PROSITE profilePS5007117.28184144IPR001356Homeobox domain
SMARTSM003897.7E-1986148IPR001356Homeobox domain
CDDcd000862.02E-1887144No hitNo description
PfamPF000461.1E-1887142IPR001356Homeobox domain
PROSITE patternPS000270119142IPR017970Homeobox, conserved site
PROSITE profilePS5084838.154286513IPR002913START domain
SuperFamilySSF559616.04E-28289510No hitNo description
CDDcd088759.40E-107290509No hitNo description
SMARTSM002342.3E-32295510IPR002913START domain
PfamPF018522.8E-39296510IPR002913START domain
SuperFamilySSF559614.62E-15529752No hitNo description
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0006355Biological Processregulation of transcription, DNA-templated
GO:0008289Molecular Functionlipid binding
GO:0043565Molecular Functionsequence-specific DNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 768 aa     Download sequence    Send to blast
MPGFSAFELP LSSPLISMSK QNFPTLSFSN REMMEGHSEL GLIGDHFDGG LLGRIKDDGY  60
ESRSGSDNFE GASGDDQEAA NDQPKRKKKY HRHTPHQTQE LEAFFKDCNH PDEKQRADLS  120
RRLGLENKQV KFWFQNRRTQ MKTQIERHEN MILKQENERL RAENSVMREA MANPMCNNCG  180
GAAIPGQISF DEHQIRIENA RLKDELNRIC ALANKFLGRP ISSLASPMSL PASNSGLELG  240
IGRNGFSGGS SGLGMSMGLD FGDFSGGTLP AISGIRSPMG LMGNEIHAER SMLLELALVA  300
MDELIKMAQA DSPLWIKMDG GKEILNQEEY ARMSSFISPT SPGYVTEASR ETGVVIINSS  360
ALVETMMDPE RWSEMFPSMV SRAVTLDVIA SGMGGSRNGS LQVMQAEIQL LSPLVPVRQL  420
SFLRFCKQHA EGLWAVVDAS IDAGRNFKRL PSGFLVQDTP NGFSKASITW VEHSQYDESL  480
IHQLYRPLIA SGLGFGAPRW IATLQRQCEC LAIHMSSSIP SEDATAISPA GRRSMLKLAQ  540
RMANNFFSGI CLSSSQKWDT HHLGNVGDEI KVMTRKNTDV PGEPPSIVLC AATSVWMPVS  600
RQRVFDFLRD ERMRGEWVIL SGIGQMKEML RISKGQVDGN SLSVLRANAN NGGESSNNTL  660
FLQETWNDTS CSALLYSPVE VESLKVVMSG GDSTYVALIP SGFTIHPDGH SGTQDGDDGG  720
SSGCLLTVGL QILLNSLPTA KLTAESVETV NDLITGTIQK IKASLGVA
Functional Description ? help Back to Top
Source Description
UniProtProbable transcription factor involved in the regulation of the tissue-specific accumulation of anthocyanins and in cellular organization of the primary root. {ECO:0000269|PubMed:10402424}.
Cis-element ? help Back to Top
SourceLink
PlantRegMapAradu.D4VWB
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_020999150.10.0homeobox-leucine zipper protein ANTHOCYANINLESS 2-like isoform X2
SwissprotQ0WV120.0ANL2_ARATH; Homeobox-leucine zipper protein ANTHOCYANINLESS 2
TrEMBLA0A445CZE70.0A0A445CZE7_ARAHY; Uncharacterized protein
STRINGGLYMA10G38280.20.0(Glycine max)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
FabidsOGEF119234106
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT4G00730.10.0HD-ZIP family protein
Publications ? help Back to Top
  1. Duarte JM, et al.
    Expression pattern shifts following duplication indicative of subfunctionalization and neofunctionalization in regulatory genes of Arabidopsis.
    Mol. Biol. Evol., 2006. 23(2): p. 469-78
    [PMID:16280546]