PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Previous version: v3.0 v4.0
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID C.cajan_24676
Common NameKK1_025039
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; fabids; Fabales; Fabaceae; Papilionoideae; Phaseoleae; Cajanus
Family Trihelix
Protein Properties Length: 328aa    MW: 37079.6 Da    PI: 9.8669
Description Trihelix family protein
Gene Model
Gene Model ID Type Source Coding Sequence
C.cajan_24676genomeIIPGView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1trihelix56.66.8e-1843141286
       trihelix   2 WtkqevlaLiearr.emeerlrrgk.............lkkplWeevskkmrergferspkqCkekwenlnkrykkikegekkrtsessstcpyfd 83 
                    Wt qe+l Li+a++ ++e+rl+++              + + +W++v ++++++g+ rs++qC++kw+nl ++ykk+++ e k+++ +++++p + 
  C.cajan_24676  43 WTIQETLILITAKKlDDERRLKTSHdptraacsttartSGELRWKWVENYCWSHGCLRSQNQCNDKWDNLLRDYKKVRDYEFKQQQSNEKHFPSYW 138
                    *************944444444443566667777887799*******************************************9999999999999 PP

       trihelix  84 qle 86 
                    +l+
  C.cajan_24676 139 NLN 141
                    886 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
Gene3DG3DSA:1.10.10.602.2E-534115IPR009057Homeodomain-like
PROSITE profilePS500906.77135113IPR017877Myb-like domain
PfamPF138371.4E-1142141No hitNo description
SuperFamilySSF1014472.51E-5205215No hitNo description
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0003677Molecular FunctionDNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 328 aa     Download sequence    Send to blast
MSDPSTTPLP PPPLLPSPHH LIQGGATASS SSSLAREYRK GNWTIQETLI LITAKKLDDE  60
RRLKTSHDPT RAACSTTART SGELRWKWVE NYCWSHGCLR SQNQCNDKWD NLLRDYKKVR  120
DYEFKQQQSN EKHFPSYWNL NKQQRKEHNL PSNMVFDVYQ AITEVLQRKQ TQPQAQTQRQ  180
PAVTLVTSSP LQTLPPPPPP PPPPPPPPPP PPPPPVSSAT QAVSERSESS GTEHSEDDDG  240
SESKRRKVKN LGSSIMRSAS VLARALRSCE EKKEKRHREL IELEQRRIQM EEARNEVHRQ  300
GIATLVAAVT NLSGAIQSLI NSERHGQR
Cis-element ? help Back to Top
SourceLink
PlantRegMapC.cajan_24676
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Nucleotide ? help Back to Top
Source Hit ID E-value Description
GenBankKF2210134e-98KF221013.1 Glycine max cultivar Kwangkyo clone 1f1r hydroxyproline-rich glycoprotein family protein (Glyma20g32540) gene, partial cds.
GenBankKF2210144e-98KF221014.1 Glycine max cultivar Ilpumgeomjeong clone 1f1r hydroxyproline-rich glycoprotein family protein (Glyma20g32540) gene, partial cds.
GenBankKF2210164e-98KF221016.1 Glycine max cultivar PI96983 clone 1f1r hydroxyproline-rich glycoprotein family protein (Glyma20g32540) gene, partial cds.
GenBankKF2210184e-98KF221018.1 Glycine max cultivar Geomjeongol clone 1f1r hydroxyproline-rich glycoprotein family protein (Glyma20g32540) gene, partial cds.
GenBankKF2210204e-98KF221020.1 Glycine soja cultivar IT162825 clone 1f1r hydroxyproline-rich glycoprotein family protein (Glyma20g32540) gene, partial cds.
GenBankKF2210234e-98KF221023.1 Glycine soja cultivar IT182840 clone 1f1r hydroxyproline-rich glycoprotein family protein (Glyma20g32540) gene, partial cds.
GenBankKF2210244e-98KF221024.1 Glycine soja cultivar IT182848 clone 1f1r hydroxyproline-rich glycoprotein family protein (Glyma20g32540) gene, partial cds.
GenBankKF2210254e-98KF221025.1 Glycine soja cultivar IT182932 clone 1f1r hydroxyproline-rich glycoprotein family protein (Glyma20g32540) gene, partial cds.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_020229739.10.0trihelix transcription factor GTL1
TrEMBLA0A151SE870.0A0A151SE87_CAJCA; Uncharacterized protein
STRINGGLYMA20G32540.21e-147(Glycine max)
STRINGXP_007143775.11e-147(Phaseolus vulgaris)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
FabidsOGEF68043242
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT2G35640.12e-45Trihelix family protein