diff --git a/projects/human-heterodimers-w-crosslinks/README.md b/projects/human-heterodimers-w-crosslinks/README.md index 902d1d68c712d794f11d35eb0dcd6f5b2555478d..137f00a256dfea6b52b484b80d12d2e3f7003c62 100644 --- a/projects/human-heterodimers-w-crosslinks/README.md +++ b/projects/human-heterodimers-w-crosslinks/README.md @@ -4,10 +4,9 @@ **I.O.U. the link to the publication** -This project consists of around 800 heterodimer models for the human reference proteome. Modelling was done with [ColabFold](https://colabfold.mmseqs.com)/ [LocalColabFold](https://github.com/YoshitakaMo/localcolabfold). Model selection is special in a sense that for some heterodimers experimental crosslinking data is available guiding the choice, otherwise top-ranking models are used. +This project consists of around 800 dimer models (vast majority heteros) for the human reference proteome. Modelling was done with [ColabFold](https://colabfold.mmseqs.com)/ [LocalColabFold](https://github.com/YoshitakaMo/localcolabfold). Model selection is special in a sense that for some heterodimers experimental crosslinking data is available guiding the choice, otherwise top-ranking models are used. -- special for this set: has a few homomers -- special for this set: old versions of UniProtKB sequences are traced down to a matching version in the history/ using UniSave +Since some of the models were build for UniProtKB entries whose sequence were updated in the meantime, the conversion script goes down entry history until it finds a matching sequence. So the ModelCIF file will reference a version of the UniProtKB entry with the sequence used during modelling. <how are the ModelCIF files created using this software>