From 9189be2fd0d561310cfae4e64a72e32eafae4dea Mon Sep 17 00:00:00 2001 From: Bienchen <stefan.bienert@unibas.ch> Date: Fri, 11 Nov 2022 09:45:30 +0000 Subject: [PATCH] Describe the handling of mismatching sequences in human-heterodimers-w-crosslinks --- projects/human-heterodimers-w-crosslinks/README.md | 5 ++--- 1 file changed, 2 insertions(+), 3 deletions(-) diff --git a/projects/human-heterodimers-w-crosslinks/README.md b/projects/human-heterodimers-w-crosslinks/README.md index 902d1d6..137f00a 100644 --- a/projects/human-heterodimers-w-crosslinks/README.md +++ b/projects/human-heterodimers-w-crosslinks/README.md @@ -4,10 +4,9 @@ **I.O.U. the link to the publication** -This project consists of around 800 heterodimer models for the human reference proteome. Modelling was done with [ColabFold](https://colabfold.mmseqs.com)/ [LocalColabFold](https://github.com/YoshitakaMo/localcolabfold). Model selection is special in a sense that for some heterodimers experimental crosslinking data is available guiding the choice, otherwise top-ranking models are used. +This project consists of around 800 dimer models (vast majority heteros) for the human reference proteome. Modelling was done with [ColabFold](https://colabfold.mmseqs.com)/ [LocalColabFold](https://github.com/YoshitakaMo/localcolabfold). Model selection is special in a sense that for some heterodimers experimental crosslinking data is available guiding the choice, otherwise top-ranking models are used. -- special for this set: has a few homomers -- special for this set: old versions of UniProtKB sequences are traced down to a matching version in the history/ using UniSave +Since some of the models were build for UniProtKB entries whose sequence were updated in the meantime, the conversion script goes down entry history until it finds a matching sequence. So the ModelCIF file will reference a version of the UniProtKB entry with the sequence used during modelling. <how are the ModelCIF files created using this software> -- GitLab