PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Previous version: v3.0 v4.0
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID KHM99145.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; fabids; Fabales; Fabaceae; Papilionoideae; Phaseoleae; Glycine; Soja
Family HD-ZIP
Protein Properties Length: 737aa    MW: 80753.9 Da    PI: 5.4461
Description HD-ZIP family protein
Gene Model
Gene Model ID Type Source Coding Sequence
KHM99145.1genomeTCUHKView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1Homeobox696.1e-2250105156
                 TT--SS--HHHHHHHHHHHHHSSS--HHHHHHHHHHCTS-HHHHHHHHHHHHHHHH CS
    Homeobox   1 rrkRttftkeqleeLeelFeknrypsaeereeLAkklgLterqVkvWFqNrRakek 56 
                 ++k +++t++q++eLe++F+++++p++++r++L+k+lgL+ +qVk+WFqNrR+++k
  KHM99145.1  50 KKKYHRHTPQQIQELEAFFKECPHPDEKQRTDLSKRLGLENKQVKFWFQNRRTQMK 105
                 79999************************************************999 PP

2START174.66e-552624842205
                 HHHHHHHHHHHHHHHC-TT-EEEE....EXCCTTEEEEEEESSS......SCEEEEEEEECCSCHHHHHHHHHCCCGGCT-TT-S....EEEEEEEECT CS
       START   2 laeeaaqelvkkalaeepgWvkss....esengdevlqkfeeskv.....dsgealrasgvvdmvlallveellddkeqWdetla....kaetleviss 87 
                 la+ a++el+k+a+ +  +W kss    e +n de+ + f++  +     + +ea r +gvv++ +   ve+l+d+  +W e++     +a+tlev+ss
  KHM99145.1 262 LALNAMNELIKMAQPDTSLWIKSSdgrnEVLNHDEYARLFSPYVGskpagYVTEATRGTGVVPASSLGIVETLMDVD-RWAEMFSsmiaSAATLEVLSS 359
                 6789********************9999999*********99666999*****************************.9****999999********** PP

                 T......EEEEEEEEXXTTXX-SSX.EEEEEEEEEEE.TTS-EEEEEEEEE-TTS--.-TTSEE-EESSEEEEEEEECTCEEEEEEEE-EE--SSXXHH CS
       START  88 g......galqlmvaelqalsplvp.RdfvfvRyirqlgagdwvivdvSvdseqkppesssvvRaellpSgiliepksnghskvtwvehvdlkgrlphw 179
                 g      galq+m ae q+lsplvp R   f+Ry++q+g+g+w++vdvSvd  +++++s++++ +++lpSg++i++++ng sk+twveh  ++++++h+
  KHM99145.1 360 GmgesrsGALQVMLAEVQLLSPLVPaRSLSFLRYSKQHGEGVWAVVDVSVDIGRNVTNSHPLMSCRRLPSGCVIQDMPNGFSKITWVEHSQYDESVVHQ 458
                 *************************************************************************************************** PP

                 HHHHHHHHHHHHHHHHHHHHTXXXXX CS
       START 180 llrslvksglaegaktwvatlqrqce 205
                 l+r+lv+sg+ +ga++w atl rqc 
  KHM99145.1 459 LYRPLVSSGIGFGAQRWIATLLRQCD 484
                 ***********************996 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
SuperFamilySSF466896.27E-2130107IPR009057Homeodomain-like
Gene3DG3DSA:1.10.10.602.5E-2335101IPR009057Homeodomain-like
PROSITE profilePS5007117.96147107IPR001356Homeobox domain
SMARTSM003892.5E-1949111IPR001356Homeobox domain
PfamPF000461.8E-1950105IPR001356Homeobox domain
CDDcd000864.12E-1854107No hitNo description
PROSITE patternPS00027082105IPR017970Homeobox, conserved site
PROSITE profilePS5084844.133252488IPR002913START domain
SuperFamilySSF559611.02E-33254485No hitNo description
CDDcd088755.37E-110256483No hitNo description
SMARTSM002343.3E-38261485IPR002913START domain
PfamPF018522.3E-46262484IPR002913START domain
SuperFamilySSF559612.61E-17504706No hitNo description
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0006355Biological Processregulation of transcription, DNA-templated
GO:0008289Molecular Functionlipid binding
GO:0043565Molecular Functionsequence-specific DNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 737 aa     Download sequence    Send to blast
MGQIGESFDT SNLLGRLRDD EYESRSGSDN FDGGSGDDQD AGDDQPHKKK KKYHRHTPQQ  60
IQELEAFFKE CPHPDEKQRT DLSKRLGLEN KQVKFWFQNR RTQMKTQLER HENMILRQEN  120
DKLRAENSVM KDALANPTCN NCGGPAIPGQ ISLEEHQTRM ENARLKDELN RICALANKFL  180
GRPLSPLASP MALPPSNSGL ELAIGRNGIG GPSNFGMSLP MGFDVGDGVM GSSPGMSSMG  240
ARSPMGMMGN EIQLERSMLL DLALNAMNEL IKMAQPDTSL WIKSSDGRNE VLNHDEYARL  300
FSPYVGSKPA GYVTEATRGT GVVPASSLGI VETLMDVDRW AEMFSSMIAS AATLEVLSSG  360
MGESRSGALQ VMLAEVQLLS PLVPARSLSF LRYSKQHGEG VWAVVDVSVD IGRNVTNSHP  420
LMSCRRLPSG CVIQDMPNGF SKITWVEHSQ YDESVVHQLY RPLVSSGIGF GAQRWIATLL  480
RQCDCLAILM SQIPSEDPTV ISLEGKKNML KLAQRMTEYF CSGICASSVR KWEILNIGNL  540
ADDMRIMARK INMDDPTEAP GIVLSASTSV WMPVSRQRVF DFLRDENLRG EWDMLSKDGP  600
MKEMLHIAKG QDRGNCVSIL HSANSECNVL YLQESWTDAS GSLVVYSPIN MQALNMVMNC  660
GDSSFVALRP SGFAILPDGA SNNGDGSDGG GSCLLTVGLQ MLPNGDQSTK FTMESVVTVN  720
SLISNTIQKV KDALGVA
Functional Description ? help Back to Top
Source Description
UniProtProbable transcription factor involved in the regulation of the tissue-specific accumulation of anthocyanins and in cellular organization of the primary root. {ECO:0000269|PubMed:10402424}.
Cis-element ? help Back to Top
SourceLink
PlantRegMapKHM99145.1
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_006587344.10.0homeobox-leucine zipper protein ANTHOCYANINLESS 2 isoform X1
RefseqXP_028180201.10.0homeobox-leucine zipper protein ANTHOCYANINLESS 2-like isoform X1
SwissprotQ0WV120.0ANL2_ARATH; Homeobox-leucine zipper protein ANTHOCYANINLESS 2
TrEMBLK7LDX20.0K7LDX2_SOYBN; Uncharacterized protein
STRINGGLYMA09G26600.20.0(Glycine max)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
FabidsOGEF119234106
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT4G00730.10.0HD-ZIP family protein
Publications ? help Back to Top
  1. Duarte JM, et al.
    Expression pattern shifts following duplication indicative of subfunctionalization and neofunctionalization in regulatory genes of Arabidopsis.
    Mol. Biol. Evol., 2006. 23(2): p. 469-78
    [PMID:16280546]