Skip to content
Snippets Groups Projects
mmcif.rst 18.72 KiB

mmCIF File Format

The mmCIF file format is an alternate container for structural entities, also provided by the PDB. Here we describe how to load those files and how to deal with information provided above the common PDB format (:class:`MMCifInfo`, :class:`MMCifInfoCitation`, :class:`MMCifInfoTransOperation`, :class:`MMCifInfoBioUnit`, :class:`MMCifInfoStructDetails`).

Loading mmCIF Files

Categories Available

The following categories of a mmCIF file are considered by the reader:

  • atom_site: Used to build the :class:`entity <ost.mol.EntityHandle>`
  • entity: Involved in setting ChainTypes
  • entity_poly: Involved in setting ChainTypes
  • citation: Goes into :class:`MMCifInfoCitation`
  • citation_author: Goes into :class:`MMCifInfoCitation`
  • exptl: Goes into :class:`MMCifInfo` as :attr:`method <MMCifInfo.method>`.
  • refine: Goes into :class:`MMCifInfo` as :attr:`resolution <MMCifInfo.resolution>`.
  • pdbx_struct_assembly: Used for :class:`MMCifInfoBioUnit`.
  • pdbx_struct_assembly_gen: Used for :class:`MMCifInfoBioUnit`.
  • pdbx_struct_oper_list: Used for :class:`MMCifInfoBioUnit`.
  • struct: Details about a structure, stored in :class:`MMCifInfoStructDetails`.
  • struct_conf: Stores secondary structure information (practically helices) in the :class:`entity <ost.mol.EntityHandle>`
  • struct_sheet_range: Stores secondary structure information for sheets in the :class:`entity <ost.mol.EntityHandle>`
  • pdbx_database_PDB_obs_spr: Verbose information on obsoleted/ superseded entries, stored in :class:`MMCifInfoObsolete`.
  • struct_ref stored in :class:`MMCifInfoStructRef`
  • struct_ref_seq stored in :class:`MMCifInfoStructRefSeq`
  • struct_ref_seq_dif stored in :class:`MMCifInfoStructRefDif`

Info Classes

Information from mmCIF files which goes beyond structural data, is kept in a special container, the :class:`MMCifInfo` class. Here is a detailed description of the annotation available.

This is the container for all bits of non-molecular data pulled from a mmCIF file.

This stores citation information from an input file.

This stores operations needed to transform an :class:`entity <ost.mol.EntityHandle>` into a bio unit.

This stores information how a structure is to be assembled to form the bio unit.

Holds details about the structure.

Holds details on obsolete/ superseded structures.

Holds the information of the struct_ref category. The category describes the link of polymers in the mmCIF file to sequences stored in external databases such as uniprot. The related categories struct_ref_seq and struct_ref_seq_dif also list differences between the sequences of the deposited structure and the sequences in the database. A promintent example of such differences include point mutations and/or expression tags.

An aligned range of residues between a sequence in a reference database and the deposited sequence.

A particular difference between the deposited sequence and the sequence in the database.