-
Studer Gabriel authoredStuder Gabriel authored
Structure comparison examples
OpenStructure provides "actions" for general benchmarking use cases.
- compare-structures: Focuses on comparisons involving polymer entities, i.e. protein, DNA and RNA chains
- compare-ligand-structures: Focuses on comparisons of interactions between polymer entities and non-polymer entities, i.e. small molecule ligands
The example commands here assume an OpenStructure installation (compile instructions: https://openstructure.org/docs/dev/install/). Detailed instructions for running equivalent computations are available for
- Docker: https://git.scicore.unibas.ch/schwede/openstructure/tree/master/docker
- Singularity https://git.scicore.unibas.ch/schwede/openstructure/tree/master/singularity
The command
ost <ACTION> -h
will list all available options of the respective action. Both actions compute scores on an “opt-in” basis and produce output in JSON format. Example to compute global and per-residue LDDT scores, as well as as QS-scores which are written to default output (out.json):
ost compare-structures -m model.pdb -r reference.cif.gz --lddt --local-lddt --qs-score
By default, model-reference chains are aligned using Needleman-Wunsch.
Many benchmarking efforts such as CASP and CAMEO assume residue numbers
according to target sequence(s). Both "actions" allow to derive model-reference
chain alignments according to these numbers which should be preferred in these
cases. This can be enabled by adding a -rna
(residue number alignment) flag:
ost compare-structures -m model.pdb -r reference.cif.gz --lddt --local-lddt --qs-score -rna
The same example also contains small molecule ligands which can be scored with:
ost compare-ligand-structures -m model.pdb -r reference.cif.gz -ml *.sdf --rmsd --lddt-pli
Again, it is advised to use the -rna
flag if applicable. In this example,
reference ligands are directly extracted from the provided mmCIF file based on
"non-polymer" entity types.
This only works in case of mmCIF input AND if the respective ligand is in the
PDB component dictionary which defines connectivity (matching based on compound
name).
Container solutions come with such a dictionary which has been created at build
time. Check the Docker/Singularity instructions linked above on how to set the
latest dictionary, i.e. "Compound Library".
You can override automatic extraction by providing SDF files with ligand
coordinates and connectivity information. If the receptor is provided in
PDB format, ligands must be provided in SDF format.