PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Previous version: v3.0 v4.0
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID PK00670.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; fabids; Rosales; Cannabaceae; Cannabis
Family HD-ZIP
Protein Properties Length: 761aa    MW: 83278.8 Da    PI: 5.7565
Description HD-ZIP family protein
Gene Model
Gene Model ID Type Source Coding Sequence
PK00670.1genomeCCBRView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1Homeobox65.95.6e-2154109156
                TT--SS--HHHHHHHHHHHHHSSS--HHHHHHHHHHCTS-HHHHHHHHHHHHHHHH CS
   Homeobox   1 rrkRttftkeqleeLeelFeknrypsaeereeLAkklgLterqVkvWFqNrRakek 56 
                ++k +++t++q++eLe++F+++++p++++r eL+++l+L+++qVk+WFqNrR+++k
  PK00670.1  54 KKKYHRHTPHQIQELESFFKECPHPDEKQRLELSRRLSLETKQVKFWFQNRRTQMK 109
                79999************************************************999 PP

2START187.85.7e-592644872205
                HHHHHHHHHHHHHHHC-TT-EEEE.....EXCCTTEEEEEEESSS......SCEEEEEEEECCSCHHHHHHHHHCCCGGCT-TT-S....EEEEEEEECT CS
      START   2 laeeaaqelvkkalaeepgWvkss.....esengdevlqkfeeskv.....dsgealrasgvvdmvlallveellddkeqWdetla....kaetleviss 87 
                la +a++el+k+a+ae p+W kss     e++n +e++++f++  +     + +ea r+s vv+ ++  lve+l+d + +W e+++    +a+t+e is 
  PK00670.1 264 LAMAAMDELLKLAQAEGPMWIKSSdgggkEMLNHEEYMRTFPPCIGakpngYVSEATRDSSVVIINSLALVETLMDAN-RWIEMFPclisRASTIEMISA 362
                57799**************************************99999******************************.********************* PP

                T......EEEEEEEEXXTTXX-SSX.EEEEEEEEEEE.TTS-EEEEEEEEE-TTS--.-TTSEE-EESSEEEEEEEECTCEEEEEEEE-EE--SSXXHHH CS
      START  88 g......galqlmvaelqalsplvp.RdfvfvRyirqlgagdwvivdvSvdseqkppesssvvRaellpSgiliepksnghskvtwvehvdlkgrlphwl 180
                g      gal +m  elq+lsplvp R   f+R+++q+g+g+w++vdvS+d +++  +  s+  +++lpSg+l++++++g+skvtwveh++++++ +h++
  PK00670.1 363 GmggtrnGALGVMHVELQVLSPLVPlRPLKFIRFCKQHGDGVWAVVDVSIDINREALNAESYFQCRRLPSGCLVQDMPDGYSKVTWVEHTEYDASGVHQM 462
                *******************************************************988****************************************** PP

                HHHHHHHHHHHHHHHHHHHTXXXXX CS
      START 181 lrslvksglaegaktwvatlqrqce 205
                +rs ++sgl +ga++w+atlqrqce
  PK00670.1 463 FRSYISSGLGFGAQRWLATLQRQCE 487
                ************************8 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
Gene3DG3DSA:1.10.10.609.3E-2240111IPR009057Homeodomain-like
SuperFamilySSF466891.04E-2040111IPR009057Homeodomain-like
PROSITE profilePS5007117.52451111IPR001356Homeobox domain
SMARTSM003891.8E-1853115IPR001356Homeobox domain
CDDcd000861.43E-1954111No hitNo description
PfamPF000461.3E-1854109IPR001356Homeobox domain
PROSITE patternPS00027086109IPR017970Homeobox, conserved site
PROSITE profilePS5084843.716254491IPR002913START domain
SuperFamilySSF559618.79E-33255488No hitNo description
CDDcd088759.37E-118258487No hitNo description
SMARTSM002342.5E-47263488IPR002913START domain
PfamPF018521.2E-50264487IPR002913START domain
SuperFamilySSF559613.54E-20516753No hitNo description
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0006355Biological Processregulation of transcription, DNA-templated
GO:0008289Molecular Functionlipid binding
GO:0043565Molecular Functionsequence-specific DNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 761 aa     Download sequence    Send to blast
MDTHGEMGLL GENFDLGMIG RIRDDGYESR SGSDNLEGAS GDDQDAGDDQ PPRKKKYHRH  60
TPHQIQELES FFKECPHPDE KQRLELSRRL SLETKQVKFW FQNRRTQMKT QLERHENIIL  120
RQENDKLRAE NNMIKDAMSN PMCNQCGGPA IPGQISFEEH QLRIENARLK DELSRICSLA  180
NKFLGRPLSS LVAPLPLPSS ASLELAMGRN GMGGLNVGPP LPMGLDLGDG VSSSAHMMPL  240
VKSSMGMSAF GNEIPFDRSM FIDLAMAAMD ELLKLAQAEG PMWIKSSDGG GKEMLNHEEY  300
MRTFPPCIGA KPNGYVSEAT RDSSVVIINS LALVETLMDA NRWIEMFPCL ISRASTIEMI  360
SAGMGGTRNG ALGVMHVELQ VLSPLVPLRP LKFIRFCKQH GDGVWAVVDV SIDINREALN  420
AESYFQCRRL PSGCLVQDMP DGYSKVTWVE HTEYDASGVH QMFRSYISSG LGFGAQRWLA  480
TLQRQCEYLA TLMSPSIPNE DQIAVSLGGR RSLLKLAKRM VDNFCAGVCA STVRKWDKLR  540
VDNVGEDVRV MTRKSMDDPG EPPGIILSAA TSVWIPVSQK RVFEFLQDER LRSEWDILSN  600
GGPMQEMVHI TKGQRHGNHV SLLRAGSMGA NDSSTMLILQ ETWTDSSGSM VVYAPVDSAA  660
INVVMHGGDS AYVALLPSGF AIVPDGSSQS GYLSGNNEST SGASVKGEGD DSGGSILTVG  720
FQILVNSLPS AKLTVESVET VNNLISCTIQ KIKSALSGPS A
Functional Description ? help Back to Top
Source Description
UniProtProbable transcription factor involved in the regulation of the tissue-specific accumulation of anthocyanins and in cellular organization of the primary root. {ECO:0000269|PubMed:10402424}.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_024028635.10.0homeobox-leucine zipper protein ANTHOCYANINLESS 2 isoform X2
SwissprotQ0WV120.0ANL2_ARATH; Homeobox-leucine zipper protein ANTHOCYANINLESS 2
TrEMBLA0A2P5CZI40.0A0A2P5CZI4_TREOI; Octamer-binding transcription factor
STRINGXP_010107411.10.0(Morus notabilis)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
FabidsOGEF119234106
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT3G61150.10.0homeodomain GLABROUS 1
Publications ? help Back to Top
  1. Duarte JM, et al.
    Expression pattern shifts following duplication indicative of subfunctionalization and neofunctionalization in regulatory genes of Arabidopsis.
    Mol. Biol. Evol., 2006. 23(2): p. 469-78
    [PMID:16280546]