PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Previous version: v3.0 v4.0
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Thecc1EG030733t3
Common NameTCM_030733
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Malvales; Malvaceae; Byttnerioideae; Theobroma
Family GRAS
Protein Properties Length: 361aa    MW: 39618.5 Da    PI: 4.9708
Description GRAS family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Thecc1EG030733t3genomeCGDView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1GRAS2148.1e-661693601194
              GRAS   1 lvelLlecAeavssgdlelaqalLarlselaspdgdpmqRlaayfteALaarlarsvselykalppsetseknsseelaalklfsevsPilkf 93 
                       l+elL +cA+a++++d+++a++l+++l++++s +g+p+qRl ay++e+L arla+s+s++ykal+++e +   s+e l+++++++e++P++kf
  Thecc1EG030733t3 169 LKELLCACAKAIDNNDMHMADWLMTQLRQMVSVSGEPIQRLGAYMLEGLVARLASSGSSIYKALRCKEPA---STELLSYMHILYEICPYFKF 258
                       5799*****************************************************************9...9******************* PP

              GRAS  94 shltaNqaIleavegeervHiiDfdisqGlQWpaLlqaLasRpegppslRiTgvgspesg..skeeleetgerLakfAeelgvpfefnvlvak 184
                       +++ aN aI ea++ e+rvHiiDf+i qG+QW++L++aLa+Rp+gpps+RiTg+++p+s   +   le +g+rL k+Ae+++vpfef+  +a 
  Thecc1EG030733t3 259 GYMSANGAIAEAMKDESRVHIIDFQIAQGAQWLTLIHALAARPGGPPSIRITGIDDPTSAyaRGGGLEIVGQRLLKLAESCKVPFEFHS-AAI 350
                       **********************************************************9999999***********************9.777 PP

              GRAS 185 rledleleeL 194
                       + ++++le+L
  Thecc1EG030733t3 351 SGTEVQLENL 360
                       8888888877 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
PROSITE profilePS5098539.598143361IPR005202Transcription factor GRAS
PfamPF035142.8E-63169360IPR005202Transcription factor GRAS
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0006355Biological Processregulation of transcription, DNA-templated
GO:0005829Cellular Componentcytosol
Sequence ? help Back to Top
Protein Sequence    Length: 361 aa     Download sequence    Send to blast
MQTSEKHKIS GKYFDQPVQE LESHCWPPNR SLDHYQSCSD DGGNGLQYSV QNLEQYCTLE  60
SSSSMQNSSS TASFSPSGSP VSQPNSQSYL SDVHHSPDNT CSSPVSGSCV TDNEHDLRHM  120
IRQLETAMLG TDSDNFDIHA INASGGATQI SIEEERWKYM MEMIARGDLK ELLCACAKAI  180
DNNDMHMADW LMTQLRQMVS VSGEPIQRLG AYMLEGLVAR LASSGSSIYK ALRCKEPAST  240
ELLSYMHILY EICPYFKFGY MSANGAIAEA MKDESRVHII DFQIAQGAQW LTLIHALAAR  300
PGGPPSIRIT GIDDPTSAYA RGGGLEIVGQ RLLKLAESCK VPFEFHSAAI SGTEVQLENL  360
G
3D Structure ? help Back to Top
Structure
PDB ID Evalue Query Start Query End Hit Start Hit End Description
5b3g_A1e-3017634526192Protein SCARECROW
5b3h_A1e-3017634525191Protein SCARECROW
5b3h_D1e-3017634525191Protein SCARECROW
Search in ModeBase
Functional Description ? help Back to Top
Source Description
UniProtProbable transcription factor involved in plant development. {ECO:0000250}.
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_007020614.20.0PREDICTED: scarecrow-like transcription factor PAT1
RefseqXP_007020616.20.0PREDICTED: scarecrow-like transcription factor PAT1
RefseqXP_017980057.10.0PREDICTED: scarecrow-like transcription factor PAT1
RefseqXP_017980058.10.0PREDICTED: scarecrow-like transcription factor PAT1
SwissprotQ8H1251e-124SCL5_ARATH; Scarecrow-like protein 5
TrEMBLA0A061F4790.0A0A061F479_THECC; Scarecrow-like transcription factor PAT1 isoform 3 (Fragment)
TrEMBLA0A061F5H80.0A0A061F5H8_THECC; Scarecrow-like transcription factor PAT1 isoform 1
STRINGEOY121390.0(Theobroma cacao)
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT1G50600.11e-121scarecrow-like 5
Publications ? help Back to Top
  1. Motamayor JC, et al.
    The genome sequence of the most widely cultivated cacao type and its use to identify candidate genes regulating pod color.
    Genome Biol., 2013. 14(6): p. r53
    [PMID:23731509]
  2. Ding Y, et al.
    Four distinct types of dehydration stress memory genes in Arabidopsis thaliana.
    BMC Plant Biol., 2013. 13: p. 229
    [PMID:24377444]