PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Previous version: v3.0 v4.0
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID 676789994
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Brassicales; Brassicaceae; Sisymbrieae; Sisymbrium
Family bHLH
Protein Properties Length: 1416aa    MW: 158711 Da    PI: 6.0738
Description bHLH family protein
Gene Model
Gene Model ID Type Source Coding Sequence
676789994genomeVEGIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1HLH29.81.1e-092473354
               HHHHHHHHHHHHHHHHHHHHHHCTSCC.C...TTS-STCHHHHHHHHHHHHHH CS
        HLH  3 rahnerErrRRdriNsafeeLrellPk.askapskKlsKaeiLekAveYIksL 54
                 +  rE+ RR+++N+ f eL  +l k +   + +K++Ka+iL+ +v+ +k+L
  676789994 24 SQKAGREKLRREKLNEHFLELGNVLGKdP---DRPKNDKATILTDTVQLLKEL 73
               566779*********************66...9999**************998 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
Gene3DG3DSA:4.10.280.103.9E-112196IPR011598Myc-type, basic helix-loop-helix (bHLH) domain
PROSITE profilePS5088813.4022173IPR011598Myc-type, basic helix-loop-helix (bHLH) domain
CDDcd000831.61E-72277No hitNo description
SuperFamilySSF474599.16E-122297IPR011598Myc-type, basic helix-loop-helix (bHLH) domain
PfamPF000103.1E-72373IPR011598Myc-type, basic helix-loop-helix (bHLH) domain
SMARTSM003538.3E-82779IPR011598Myc-type, basic helix-loop-helix (bHLH) domain
SuperFamilySSF483717.85E-8371501IPR016024Armadillo-type fold
SuperFamilySSF483717.85E-8538656IPR016024Armadillo-type fold
SuperFamilySSF483717.85E-8778932IPR016024Armadillo-type fold
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0046983Molecular Functionprotein dimerization activity
Sequence ? help Back to Top
Protein Sequence    Length: 1416 aa     Download sequence    Send to blast
MCAHVPLIDP LCVHKVMDVS ARKSQKAGRE KLRREKLNEH FLELGNVLGK DPDRPKNDKA  60
TILTDTVQLL KELTSEVNKL KSEYTALTDE SRELTQEKND LREEKTSLKS DIENLNLQYQ  120
QRLRSMSPWG AAMDHTVMMA PPPSFPYPMP MAMPPGSIPM HPSMPSYPYF GNQNPSMMPA  180
PCTTYMPYMP PNTVVEQQSV HIPQNPSNRS REPRTKTSRE SRSEKAEDSN DVATQLELKT  240
PGSTSDKDTS SQRPEKTKRS KRNSNNNNSV EESSHSSKCS SSPSPHSNRT LGRRMTTMTP  300
EKTPARPLSI QDWDVLIEDF QDAGAPRDWF TAVFSTDSLV DFALSSLLKK EFPLQAKLSI  360
LVFLDEFSET LFDKCGNETF DRFIDALRAI VQSPTDGSSG LKEQAMISFT SVLVSIDSFS  420
VRHVEAVVDL LLALVNRPNH GFDRQARAIA CECLRQLERA FPGLLSDVAG HLWSLCQAER  480
THAVQAYLLL FTTVVYNVVN QKLGVSLLST SVPLVPFNAP NWIRDQSSVP GQGQSQGLGP  540
DQKELRRTLA FMLESPYLFT SCAMMEFMGM VVPLASALEL QASMLKVQFL GMIYSFDPML  600
CHVVLLMYSQ FPDAFEGQEK EIMRRLMLLS KETQIYLVFR LLALHWLMGL LNKLMLSGEL  660
GKRKSVLEMG QKFHPVVFDP LALKALKLDL LVQCSVSSNA LGGGDNSKSA GELLQECLES  720
VSDFKWLPPW SSETALAFRT LHKFLICAST HSDSDPSTTR SLMESSLFQN LQGLLVEMTL  780
KFEILVPVIV AFIERLINCQ KHQWLGERFL QTIDEKLLPK LEKNNLLTAY FPLFHRIAEN  840
DTIPPSRLIE LLTKFVVSLV DKRGFDVGLK LWDQGTEVLG ICRTLLSHHK SSRLFLGLSR  900
LLSLMCLYFP DLEVRDNARI YLRMLVCIPG RKIKNILKPA DTVSPSTHSS TFFTVQSPRF  960
RHDPNKSWNL SSYIHLERVT PLLVKQSWSL SLPSVGFGND GYSVIESKIQ VDEVEPESSQ  1020
ELQLLPDSRR IESGKPTLRV MDAKIAEILE RLRRYFSVIP DFRHMPGIKV RITCTLRLDA  1080
EPYSSIWESQ TQSTELDKVD TPPAIFATVL KFSSSAPYGS IPSCRIPFLL GEPHLDKNVP  1140
NDEGSLDIVL LENTRKEEEK DGLGGVPVTV ELEPREPTPG LVEVSMEANA ENGQMIQGIL  1200
ESVPVGIEDM FLKALAPPNV PEDTIPSYYS NLFNALWEVC GSSSSTAHET FALKGGKTAA  1260
AISGTRSVKL LEVPAETVIQ ASELHLAPFV VAITGEQLVN IVRDGGTIEN IVWQEEEEEE  1320
AQATADRSTS SSVGATGSNR CPLRLTYIGY GDDQQIPMTR SRGKMGKIKM LMFLPQRYHL  1380
LFEMEVGEGS TLVHIRTDYW PCLAYVDDYL EALFLH
Cis-element ? help Back to Top
SourceLink
PlantRegMap676789994
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Nucleotide ? help Back to Top
Source Hit ID E-value Description
GenBankAB0256310.0AB025631.1 Arabidopsis thaliana genomic DNA, chromosome 3, P1 clone: MPN9.
GenBankCP0026860.0CP002686.1 Arabidopsis thaliana chromosome 3, complete sequence.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_006406446.10.0AP-5 complex subunit beta-1
TrEMBLA0A1J3E1D00.0A0A1J3E1D0_NOCCA; AP-5 complex subunit beta-1 (Fragment)
STRINGXP_006406446.10.0(Eutrema salsugineum)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM1130422
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT3G19860.21e-137bHLH family protein