PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Previous version: v3.0 v4.0
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID KHN31523.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; fabids; Fabales; Fabaceae; Papilionoideae; Phaseoleae; Glycine; Soja
Family HD-ZIP
Protein Properties Length: 742aa    MW: 81002 Da    PI: 5.5262
Description HD-ZIP family protein
Gene Model
Gene Model ID Type Source Coding Sequence
KHN31523.1genomeTCUHKView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1Homeobox69.93.1e-2256111156
                 TT--SS--HHHHHHHHHHHHHSSS--HHHHHHHHHHCTS-HHHHHHHHHHHHHHHH CS
    Homeobox   1 rrkRttftkeqleeLeelFeknrypsaeereeLAkklgLterqVkvWFqNrRakek 56 
                 ++k +++t++q++eLe++F+++++p++++r++L+kklgL+ +qVk+WFqNrR+++k
  KHN31523.1  56 KKKYHRHTPQQIQELEAFFKECPHPDEKQRTDLSKKLGLENKQVKFWFQNRRTQMK 111
                 79999************************************************999 PP

2START177.77.1e-562684912205
                 HHHHHHHHHHHHHHHC-TT-EEEE....EXCCTTEEEEEEESSS.......SCEEEEEEEECCSCHHHHHHHHHCCCGGCT-TT-S....EEEEEEEEC CS
       START   2 laeeaaqelvkkalaeepgWvkss....esengdevlqkfeeskv......dsgealrasgvvdmvlallveellddkeqWdetla....kaetlevis 86 
                 la++a++el+k+a+ +  +W kss    e +n de+ + f++  +      + +ea r +gvv + +  lve+l+d   qW+e++     +a+t+ev+s
  KHN31523.1 268 LALSAMNELIKMAQPDTSLWIKSSdgrnEVLNHDEYARLFSPYIGskpaagYVTEATRGTGVVSASSLGLVEILMDAD-QWSEMFSsmiaSAATVEVLS 365
                 6899********************9999999**********99999********************************.******************** PP

                 TT......EEEEEEEEXXTTXX-SSX.EEEEEEEEEEE.TTS-EEEEEEEEE-TTS--.-TTSEE-EESSEEEEEEEECTCEEEEEEEE-EE--SSXXH CS
       START  87 sg......galqlmvaelqalsplvp.RdfvfvRyirqlgagdwvivdvSvdseqkppesssvvRaellpSgiliepksnghskvtwvehvdlkgrlph 178
                 sg      galq+m ae q+lsplvp R + f+R+++q+ +g w++vdvSvd  +++++s++++ +++lpSg++i++++ng s++twveh  ++++++h
  KHN31523.1 366 SGtggtrsGALQVMLAEVQLLSPLVPaRQVSFLRFCKQHAEGLWAVVDVSVDIGRNVTNSHPLMSCRRLPSGCVIQDMPNGFSNITWVEHSQYDESVIH 464
                 *************************************************************************************************** PP

                 HHHHHHHHHHHHHHHHHHHHHTXXXXX CS
       START 179 wllrslvksglaegaktwvatlqrqce 205
                 +l+r+lv+sg+ +ga++w atl rqc 
  KHN31523.1 465 QLYRPLVSSGIGFGAQRWIATLLRQCD 491
                 ************************996 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
SuperFamilySSF466895.43E-2136113IPR009057Homeodomain-like
Gene3DG3DSA:1.10.10.601.7E-2341107IPR009057Homeodomain-like
PROSITE profilePS5007117.99353113IPR001356Homeobox domain
SMARTSM003891.1E-1955117IPR001356Homeobox domain
PfamPF000469.0E-2056111IPR001356Homeobox domain
CDDcd000862.72E-1860113No hitNo description
PROSITE patternPS00027088111IPR017970Homeobox, conserved site
PROSITE profilePS5084843.471258495IPR002913START domain
SuperFamilySSF559618.79E-32260492No hitNo description
CDDcd088751.12E-108262490No hitNo description
SMARTSM002341.4E-39267492IPR002913START domain
PfamPF018522.9E-47268491IPR002913START domain
SuperFamilySSF559615.63E-17510714No hitNo description
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0006355Biological Processregulation of transcription, DNA-templated
GO:0005634Cellular Componentnucleus
GO:0008289Molecular Functionlipid binding
GO:0043565Molecular Functionsequence-specific DNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 742 aa     Download sequence    Send to blast
MEGHSEMGLM GESFDTSNLL GRMRDDEYES RSGSDNFDGG SGDDQDAGDD QPHKKKKKYH  60
RHTPQQIQEL EAFFKECPHP DEKQRTDLSK KLGLENKQVK FWFQNRRTQM KTQLERHENM  120
ILRQENDKLR AENSVMKDAL ANPICNNCGG PAIPGQISLE EHQTRMENAR LKDELNRICA  180
LANKFLGRPL SPLASPMALP PSNSGLELAI GRNGLGGSSN FGMPLPMGFD VGDGALGSSP  240
AMSTMGARSP MGMMGNEIQL ERSMLLDLAL SAMNELIKMA QPDTSLWIKS SDGRNEVLNH  300
DEYARLFSPY IGSKPAAGYV TEATRGTGVV SASSLGLVEI LMDADQWSEM FSSMIASAAT  360
VEVLSSGTGG TRSGALQVML AEVQLLSPLV PARQVSFLRF CKQHAEGLWA VVDVSVDIGR  420
NVTNSHPLMS CRRLPSGCVI QDMPNGFSNI TWVEHSQYDE SVIHQLYRPL VSSGIGFGAQ  480
RWIATLLRQC DCLAILRSPQ GPSEDPTAQA GRTNMMKLAQ RMTECFCSGI CASSACKWDI  540
LHIGNLADDM RIMARKIDDP TEAPGIVLSA STSVWMPVSR KRVFDFLRDE NLRGEWDLLS  600
KDGPMKEMLH IAKGQDRGNC VSILHSANSE CNVLYLQESW SDASGSMVVY SPINMQALQM  660
VMSCGDSSFV PLRPSGFAIL PDGTSNNGDG SDGGGSCLLT VGLQMLPNGN HQSAKFTMES  720
VDAVNNLISF TIQKVKDALG VA
Functional Description ? help Back to Top
Source Description
UniProtProbable transcription factor involved in the regulation of the tissue-specific accumulation of anthocyanins and in cellular organization of the primary root. {ECO:0000269|PubMed:10402424}.
Cis-element ? help Back to Top
SourceLink
PlantRegMapKHN31523.1
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_028205953.10.0homeobox-leucine zipper protein ANTHOCYANINLESS 2-like isoform X2
SwissprotQ0WV120.0ANL2_ARATH; Homeobox-leucine zipper protein ANTHOCYANINLESS 2
TrEMBLA0A445GKQ00.0A0A445GKQ0_GLYSO; Homeobox-leucine zipper protein ANTHOCYANINLESS 2 isoform A
STRINGGLYMA16G32130.20.0(Glycine max)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
FabidsOGEF119234106
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT4G00730.10.0HD-ZIP family protein
Publications ? help Back to Top
  1. Duarte JM, et al.
    Expression pattern shifts following duplication indicative of subfunctionalization and neofunctionalization in regulatory genes of Arabidopsis.
    Mol. Biol. Evol., 2006. 23(2): p. 469-78
    [PMID:16280546]
  2. Qi X, et al.
    Identification of a novel salt tolerance gene in wild soybean by whole-genome sequencing.
    Nat Commun, 2014. 5: p. 4340
    [PMID:25004933]