NDB ID: 5VI5 PDB ID: 5VI5 
Title:
STRUCTURE OF MYCOBACTERIUM SMEGMATIS TRANSCRIPTION INITIATION COMPLEX WITH A FULL TRANSCRIPTION BUBBLE
Molecular Description:
DNA-directed RNA polymerase subunit alpha
DNA-directed RNA polymerase subunit beta
DNA-directed RNA polymerase subunit beta'
DNA-directed RNA polymerase subunit omega
RNA polymerase sigma factor SigA
RNA polymerase-binding protein RbpA/DNA Complex
Deposited:
2017-04-14
Released:
2017-07-26
Structural Keywords:
RH DOUBLE HELIX
Nucleic Acid Sequence:
Click to show/hide 3 nucleic acid sequences
Chain O
(DG)(DC)(DT)(DT)(DG)(DA)(DC)(DA)(DA)(DA)(DA)(DG)(DT)(DG)(DT)(DT)(DA)(DA)(DA)(DT) (DT)(DG)(
DT)(DG)(DC)(DT)(DA)(DT)(DA)(DC)(DT)(DG)(DG)(DG)(DA)(DG)(DC)(DC)(DG)(DT) (DC)(DA)(DC)(DG)(D
G)(DA)(DT)(DG)(DC)(DG)
Chain Q
UCGA
Chain P
(DC)(DG)(DC)(DA)(DT)(DC)(DC)(DG)(DT)(DG)(DA)(DG)(DT)(DC)(DG)(DA)(DG)(DG)(DA)(DT) (DA)(DA)(
DT)(DA)(DA)(DG)(DC)(DA)(DC)(DA)(DA)(DT)(DT)(DT)(DA)(DA)(DC)(DA)(DC)(DT) (DT)(DT)(DT)(DG)(D
T)(DC)(DA)(DA)(DG)(DC)
Protein Sequence:
Click to show/hide 6 protein sequences
Chain C
MLEGCILAVSSQSKSNAITNNSVPGAPNRVSFAKLREPLEVPGLLDVQTDSFEWLVGSDRWRQAAIDRGEENPVGGLEEV LAELSPIED
FSGSMSLSFSDPRFDEVKASVDECKDKDMTYAAPLFVTAEFINNNTGEIKSQTVFMGDFPMMTEKGTFIIN GTERVVVSQLVRSPGVYF
DETIDKSTEKTLHSVKVIPGRGAWLEFDVDKRDTVGVRIDRKRRQPVTVLLKALGWTNENIV ERFGFSEIMMGTLEKDTTSGTDEALLD
IYRKLRPGEPPTKESAQTLLENLFFKEKRYDLARVGRYKVNKKLGLNAGKPIT SSTLTEEDVVATIEYLVRLHEGQTSMTVPGGVEVPV
EVDDIDHFGNRRLRTVGELIQNQIRVGLSRMERVVRERMTTQDV EAITPQTLINIRPVVAAIKEFFGTSQLSQFMDQNNPLSGLTHKRR
LSALGPGGLSRERAGLEVRDVHPSHYGRMCPIETP EGPNIGLIGSLSVYARVNPFGFIETPYRKVENGVVTDQIDYLTADEEDRHVVAQ
ANSPTDENGRFTEDRVMVRKKGGEVE FVSADQVDYMDVSPRQMVSVATAMIPFLEHDDANRALMGANMQRQAVPLVRSEAPLVGTGMEL
RAAIDAGDVVVADKTGV IEEVSADYITVMADDGTRQSYRLRKFARSNHGTCANQRPIVDAGQRVEAGQVIADGPCTQNGEMALGKNLLV
AIMPWEGH NYEDAIILSNRLVEEDVLTSIHIEEHEIDARDTKLGAEEITRDIPNVSDEVLADLDERGIVRIGAEVRDGDILVGKVTPK
GETELTPEERLLRAIFGEKAREVRDTSLKVPHGESGKVIGIRVFSREDDDELPAGVNELVRVYVAQKRKISDGDKLAGRH GNKGVIGKI
LPVEDMPFLPDGTPVDIILNTHGVPRRMNIGQILETHLGWVAKAGWNIDVAAGVPDWASKLPEELYSAPAD STVATPVFDGAQEGELAG
LLGSTLPNRDGEVMVDADGKSTLFDGRSGEPFPYPVTVGYMYILKLHHLVDDKIHARSTGPY SMITQQPLGGKAQFGGQRFGEMECWAM
QAYGAAYTLQELLTIKSDDTVGRVKVYEAIVKGENIPEPGIPESFKVLLKELQ SLCLNVEVLSSDGAAIEMRDGDDEDLERAAANLGIN
LSRNESASVEDLA
Chain A,B
MLISQRPTLSEETVAENRSRFVIEPLEPGFGYTLGNSLRRTLLSSIPGAAVTSIRIDGVLHEFTTVPGVKEDVTDIILNL KGLVVSSDD
DEPVTMYLRKQGPGVVTAGDIVPPAGVTVHNPDMHIATLNDKGKLEVELVVERGRGYVPAVQNKASGAEIG RIPVDSIYSPVLKVTYKV
EATRVEQRTDFDKLIIDVETKNSISPRDALASAGGTLVELFGLARELNADSEHIEIGPSPAE ADHIASFALPIDDLDLTVRSYNCLKRE
GVHTVGELVARTESDLLDIRNFGQKSIDEVKIKLHQLGLSLKDSPATFDPSEV AGYDAATGTWTSDAGYDLDDNQDYAETEQL
Chain E
MSTPHADAQLNAADDLGIDSSAASAYDTPLGITNPPIDELLSRASSKYALVIYAAKRARQINDYYNQLGDGILEYVGPLV EPGLQEKPL
SIALREIHGDLLEHTEGE
Chain D
MLDVNFFDELRIGLATADDIRNWSYGEVKKPETINYRTLKPEKDGLFCEKIFGPTRDWECYCGKYKRVRFKGIICERCGV EVTRAKVRR
ERMGHIELAAPVTHIWYFKGVPSRLGYLLDLAPKDLEKIIYFAAYVITSVDDEMRHNELSTLEAEMAVEKK AVEDQRDADLEARAQKLE
ADLAELEAEGAKSDVRRKVRDSGEREMRQLRDRAQRELDRLDEIWNTFTKLAPKQLIVDEVL YRELQDRYGEYFTGAMGAESIKKLIEN
FDIDAEAESLREVIRSGKGQKKLRALKRLKVVAAFQQSGNSPMGMVLDAVPVI PPELRPMVQLDGGRFATSDLNDLYRRVINRNNRLKR
LIDLGAPEIIVNNEKRMLQESVDALFDNGRRGRPVTGPGNRPLK SLSDLLKGKQGRFRQNLLGKRVDYSGRSVIVVGPQLKLHQCGLPK
LMALELFKPFVMKRLVDLNHAQNIKSAKRMVERQR PQVWDVLEEVIAEHPVLLNRAPTLHRLGIQAFEPQLVEGKAIQLHPLVCEAFNA
DFDGDQMAVHLPLSAEAQAEARILML SSNNILSPASGKPLAMPRLDMVTGLYYLTTLVEGATGEYQAATKDAPEQGVYSSPAEAIMAMD
RGALSVRAKIKVRLTEL RPPTDLEAQLFENGWKPGDAWTEETTLGRVMFNELLPKSYPFVNEQMHKKVQARIINDLAERFPMIVVAQTV
DKLKDAGF YWATRSGVTVSMADVLVPPQKQEILERHEAEADAIERKYQRGALNHTERNESLVKIWQDATEEVGKALEEFYPADNPIIT
IVKSGATGNLTQTRTLAGMKGLVTNPKGEFIPRPIKSSFREGLTVLEYFINTHGARKGLADTALRTADSGYLTRRLVDVS QDVIVREHD
CETERGINVTLAERGPDGTLIRDAHVETSAFARTLATDAVDANGNVIIERGHDLGDPAIDALLAAGITTVK VRSVLTCTSATGVCAMCY
GRSMATGKLVDIGEAVGIVAAQSIGEPGTQLTMRTFHQGGVTGGADIVGGLPRVQELFEARV PRNKAPIADVAGRVRLEESDKFFKITI
VPDDGGEEVVYDKLSKRQRLRVITHEDGTEGVLSDGDHVEVGDQLMEGAADPH EVLRVQGPREVQIHLVKEVQEVYRAQGVSIHDKHIE
VIVRQMLRRVTIIDSGSTEFLPGSLTERAEFEAENRRVVAEGGE PAAGRPVLMGITKASLATDSWLSAASFQETTRVLTDAAINCRSDK
LNGLKENVIIGKLIPAGTGISRYRNINVQPTEEAR AAAYTIPSYEDQYYSPDFGQATGAAVPLDDYGYSDYR
Chain J
MADRVLRGSRLGAVSYETDRNHDLAPRQVARYRTDNGEEFDVPFADDAEIPGTWLCRNGLEGTLIEGDVPEPKKVKPPRT HWDMLLERR
SVEELEELLKERLDLIKAKRRGTGS
Chain F
MAATKASPATEEPVKRTATKTPAKKAPAKRAAKSAAAKAGGKAPAKKAPAKRAAKGTAAKPEDGVTDDLEVTDDLEAEPG EDLDVEDTD
LELDDLDSDDDTAVEDEEEEADAATPAVATAKAADDDIDEPSEKDKASGDFVWDEEESEALRQARKDAELT ASADSVRAYLKQIGKVAL
LNAEEEVELAKRIEAGLYATQKLAELAEKGEKLPVQQRRDMQWICRDGDRAKNHLLEANLRL VVSLAKRYTGRGMAFLDLIQEGNLGLI
RAVEKFDYTKGYKFSTYATWWIRQAITRAMADQARTIRIPVHMVEVINKLGRI QRELLQDLGREPTPEELAKEMDITPEKVLEIQQYAR
EPISLDQTIGDEGDSQLGDFIEDSEAVVAVDAVSFTLLQDQLQS VLETLSEREAGVVRLRFGLTDGQPRTLDEIGQVYGVTRERIRQIE
SKTMSKLRHPSRSQVLRDYLD
Primary Citation:
Hubin, E.A., Lilic, M., Darst, S.A., Campbell, E.A.
Structural insights into the mycobacteria transcription initiation complex from analysis of X-ray crystal structures. 
Nat Commun, 8, pp. 16072 - 16072, 2017.
Experimental Information:
X-RAY DIFFRACTION
Space Group:
P 1 21 1
Cell Constants:
a = 132.062 b = 163.555 c = 139.964 (Ångstroms)
α = 90.0 β = 107.9 γ = 90.0 (degrees)
Refinement:
The structure was refined using the PHENIX program.
The R value is 0.254 for 87525 reflections in
the resolution range 50.015 to 3.196 Ångstroms
with Fobs > 1.33 sigma(Fobs) and with I > 0.0 sigma(I)