Skip to content
Snippets Groups Projects
Commit 4fc41be9 authored by Bienchen's avatar Bienchen
Browse files

Refurbished the Input Requirements section of README.md

parent 911e25bf
No related branches found
No related tags found
No related merge requests found
...@@ -25,23 +25,27 @@ Input Requirements ...@@ -25,23 +25,27 @@ Input Requirements
------------------ ------------------
The container can read protein structures in PDB (.pdb) or mmCIF (.cif/.mmcif) The container can read protein structures in PDB (.pdb) or mmCIF (.cif/.mmcif)
format. Compressed files are accepted, too (.gz suffix, i.e. 1crn.pdb.gz). format. Compressed files are accepted, too (.gz suffix, e.g. 1crn.pdb.gz).
Providing SEQRES sequences is recommended, this is the actual protein sequence Providing SEQRES sequences (via the option `--seqres`) is recommended. SEQRES
which is not necessarily fully covered by the structural data. If not provided, describes the actual protein sequence of consecutively connected amino acids.
sequences are extracted from the structural data. SEQRES must be provided as one It is not necessarily completely covered by the structural data, think of
or several sequences in a single FASTA file. missing terminii in a model or gaps. If not provided, sequences are extracted
from the structural data. SEQRES must be provided as one or multiple sequences
in a single FASTA file.
* One sequence: All chains of the input structure(s) must align to it, * One sequence: All chains of the input structure(s) must align to it,
the structures are either monomers or homo-oligmers. the structures are either monomers or homo-oligmers.
* Several sequences: Required for hetero-oligomers, uses name based * Multiple sequences: Required for hetero-oligomers, uses name based
chain/SEQRES matching chain/SEQRES matching
The container calculates sequence profiles using The container calculates sequence profiles using
[HHblits](https://github.com/soedinglab/hh-suite) for each unique SEQRES HHblits ([DOI](https://doi.org/10.1186/s12859-019-3019-7),
sequence. If you already have the respective profiles available in a3m format, [Git](https://github.com/soedinglab/hh-suite)) for each unique SEQRES sequence.
you can speed things up. This only works if you also provide SEQRES as an input If you already have the respective profiles available in a3m format, you can
and the master sequence for each profile must match one of the SEQRES sequences. speed things up (via the option `--profiles`). This only works if you also
provide SEQRES as an input and the master sequence for each profile must match
one of the SEQRES sequences.
<a name="qmeanpull"></a>Obtain the image (Docker `pull`) <a name="qmeanpull"></a>Obtain the image (Docker `pull`)
-------------------------------------------------------- --------------------------------------------------------
......
0% Loading or .
You are about to add 0 people to the discussion. Proceed with caution.
Please register or to comment