-
Marco Biasini authored
The mmCIF reader now also parses struct_ref_seq and struct_ref_seq_dif. Includes unit tests and docs
Marco Biasini authoredThe mmCIF reader now also parses struct_ref_seq and struct_ref_seq_dif. Includes unit tests and docs
mmCIF File Format
The mmCIF file format is an alternate container for structural entities, also provided by the PDB. Here we describe how to load those files and how to deal with information provided above the common PDB format (:class:`MMCifInfo`, :class:`MMCifInfoCitation`, :class:`MMCifInfoTransOperation`, :class:`MMCifInfoBioUnit`, :class:`MMCifInfoStructDetails`).
Loading mmCIF Files
Categories Available
The following categories of a mmCIF file are considered by the reader:
-
atom_site
: Used to build the :class:`entity <ost.mol.EntityHandle>` -
entity
: Involved in setting ChainTypes -
entity_poly
: Involved in setting ChainTypes -
citation
: Goes into :class:`MMCifInfoCitation` -
citation_author
: Goes into :class:`MMCifInfoCitation` -
exptl
: Goes into :class:`MMCifInfo` as :attr:`method <MMCifInfo.method>`. -
refine
: Goes into :class:`MMCifInfo` as :attr:`resolution <MMCifInfo.resolution>`. -
pdbx_struct_assembly
: Used for :class:`MMCifInfoBioUnit`. -
pdbx_struct_assembly_gen
: Used for :class:`MMCifInfoBioUnit`. -
pdbx_struct_oper_list
: Used for :class:`MMCifInfoBioUnit`. -
struct
: Details about a structure, stored in :class:`MMCifInfoStructDetails`. -
struct_conf
: Stores secondary structure information (practically helices) in the :class:`entity <ost.mol.EntityHandle>` -
struct_sheet_range
: Stores secondary structure information for sheets in the :class:`entity <ost.mol.EntityHandle>` -
pdbx_database_PDB_obs_spr
: Verbose information on obsoleted/ superseded entries, stored in :class:`MMCifInfoObsolete`. -
struct_ref
stored in :class:`MMCifInfoStructRef` -
struct_ref_seq
stored in :class:`MMCifInfoStructRefSeq` -
struct_ref_seq_dif
stored in :class:`MMCifInfoStructRefDif`
Info Classes
Information from mmCIF files which goes beyond structural data, is kept in a special container, the :class:`MMCifInfo` class. Here is a detailed description of the annotation available.
This is the container for all bits of non-molecular data pulled from a mmCIF file.
This stores citation information from an input file.
This stores operations needed to transform an :class:`entity <ost.mol.EntityHandle>` into a bio unit.
This stores information how a structure is to be assembled to form the bio unit.
Holds details about the structure.
Holds details on obsolete/ superseded structures.
Holds the information of the struct_ref category. The category describes the
link of polymers in the mmCIF file to sequences stored in external databases
such as uniprot. The related categories struct_ref_seq
and
struct_ref_seq_dif
also list differences between the sequences of the
deposited structure and the sequences in the database. A promintent example of
such differences include point mutations and/or expression tags.
An aligned range of residues between a sequence in a reference database and the deposited sequence.
A particular difference between the deposited sequence and the sequence in the database.