PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Previous version: v3.0 v4.0
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Thecc1EG041814t4
Common NameTCM_041814
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Malvales; Malvaceae; Byttnerioideae; Theobroma
Family GRAS
Protein Properties Length: 677aa    MW: 75541.6 Da    PI: 4.7641
Description GRAS family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Thecc1EG041814t4genomeCGDView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1GRAS258.52.4e-794116771266
              GRAS   1 lvelLlecAeavssgdlelaqalLarlselaspdgdpmqRlaayfteALaarlarsvselykalppsetseknsseelaalklfsevsPilkf 93 
                       l++lL+ cA+avs++d + a +lL++++e++sp gd +qRla++f+++L+arl +s++ + +   +s +s+++ ++ l+a++++  ++P+ k+
  Thecc1EG041814t4 411 LRTLLILCAQAVSADDRRTAGELLKQIKEHSSPLGDGTQRLAHFFANGLEARLDGSGTAIQNLY-SSLASKTTAADMLKAYQVYLCACPFKKL 502
                       5789***************************************************888887655.555556889******************* PP

              GRAS  94 shltaNqaIleavegeervHiiDfdisqGlQWpaLlqaLasRpegppslRiTgvgspesg..skeeleetgerLakfAeelgvpfefnvlvak 184
                       s + aN++I   +e+++++Hi+Df+i +G+QWp L+q L++Rp+gpp+lRiTg++ p+ g   +e++eetg+rL++++++++vpfe+n+++a+
  Thecc1EG041814t4 503 SIFFANKMIWHMAEKASALHIVDFGILYGFQWPILIQHLSKRPGGPPKLRITGIEIPQRGfrPAERIEETGRRLERYCKRFDVPFEYNPMAAQ 595
                       **********************************************************99******************************9** PP

              GRAS 185 rledleleeLrvkpgEalaVnlvlqlhrlldesvsleserdevLklvkslsPkvvvvveqeadhnsesFlerflealeyysa 266
                       ++e++++e++++k++E+laVn+ ++ ++llde+++++ +r++vLkl+++++P+++v++  + ++n++ Fl+rf eal + sa
  Thecc1EG041814t4 596 NWETIQVEDIKIKSNEMLAVNCLFRFKNLLDETAEVDCPRNAVLKLIRKMNPDIFVHSIDNGSYNAPFFLTRFREALFHLSA 677
                       *****************************************************************************99875 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
PROSITE profilePS5098552.11385677IPR005202Transcription factor GRAS
PfamPF035148.3E-77411677IPR005202Transcription factor GRAS
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0006355Biological Processregulation of transcription, DNA-templated
Sequence ? help Back to Top
Protein Sequence    Length: 677 aa     Download sequence    Send to blast
MVMDPKFTEF TDYINGFGVE DDALLFTSGQ YPNFTNGLEF NVSSPDLGFM SANVPVIPPN  60
PDPGISVPPA TVSSDGSSFS ASTGWSPDGE SSSPSDDSDS TDPVLKYIRQ MLMEENMEDK  120
PFMFNDYLAL EDTEKSLYEV LGEQYPPSNQ PQPFLNVNVE SPDSNLSGNS RDNGSNSNST  180
TSISTSNGTS NYIDHWGVGE VVEHAPSLLQ APLSGDYHFQ SNLQQPSSQF SVNSTNSSSN  240
MGNGLMESSL SELLVQNIFS DKESVLQFQR GFEEASKFLP SSNQLIIDLE SNKFPMVQKG  300
KVPNLVVKVE KDERENSPDE LRGRKNHERD DGGLEEERSN KQSAVYTEES DLSDMFDKVL  360
LCTDGKAMCG YNKALQQGET KTLLQKEQSN ESSVGKTRSK KQEKKKETVD LRTLLILCAQ  420
AVSADDRRTA GELLKQIKEH SSPLGDGTQR LAHFFANGLE ARLDGSGTAI QNLYSSLASK  480
TTAADMLKAY QVYLCACPFK KLSIFFANKM IWHMAEKASA LHIVDFGILY GFQWPILIQH  540
LSKRPGGPPK LRITGIEIPQ RGFRPAERIE ETGRRLERYC KRFDVPFEYN PMAAQNWETI  600
QVEDIKIKSN EMLAVNCLFR FKNLLDETAE VDCPRNAVLK LIRKMNPDIF VHSIDNGSYN  660
APFFLTRFRE ALFHLSA
3D Structure ? help Back to Top
Structure
PDB ID Evalue Query Start Query End Hit Start Hit End Description
5b3g_A1e-363966774272Protein SCARECROW
5b3h_A1e-363966773271Protein SCARECROW
5b3h_D1e-363966773271Protein SCARECROW
Search in ModeBase
Functional Description ? help Back to Top
Source Description
UniProtProbable transcription factor involved in plant development. {ECO:0000250}.
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_021278110.10.0scarecrow-like protein 14
RefseqXP_021278112.10.0scarecrow-like protein 14
RefseqXP_021278113.10.0scarecrow-like protein 14
RefseqXP_021278114.10.0scarecrow-like protein 14
SwissprotQ9XE580.0SCL14_ARATH; Scarecrow-like protein 14
TrEMBLA0A061GXR90.0A0A061GXR9_THECC; GRAS family transcription factor, putative isoform 4 (Fragment)
STRINGEOY339990.0(Theobroma cacao)
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT1G07530.10.0SCARECROW-like 14
Publications ? help Back to Top
  1. Duarte JM, et al.
    Expression pattern shifts following duplication indicative of subfunctionalization and neofunctionalization in regulatory genes of Arabidopsis.
    Mol. Biol. Evol., 2006. 23(2): p. 469-78
    [PMID:16280546]
  2. Motamayor JC, et al.
    The genome sequence of the most widely cultivated cacao type and its use to identify candidate genes regulating pod color.
    Genome Biol., 2013. 14(6): p. r53
    [PMID:23731509]