Knots and $theta$-curves identification in polymeric chains and native proteins using neural networks

Item request has been placed!

Item request cannot be made.

Processing Request

Read More Add to Saved list

Author(s): da Silva, Fernando Bruno; Gabrovšek, Boštjan; Korpacz, Marta; Luczkiewicz, Kamil; Niewieczerzal, Szymon; Sikora, Maciej; Sulkowska, Joanna I.
Source:
Macromolecules, vol. 57, no. 9, pp. 4599-4608, 2024. ; ISSN: 0024-9297
Subject Terms:
machine learning; topology; protein databases; entanglements; open knots; closed knots; strojno učenje; topologija; proteinska baza podatkov; zavozlanost; odprti vozli; sklenjeni vozli; info:eu-repo/classification/udc/004.85:004.725.4
Document Type:
article in journal/newspaper
Language:
English

Additional Information
- Publication Information:
  American Chemical Society
- Publication Date:
  2025
- Collection:
  University of Ljubljana: Repository (RUJ) / Repozitorij Univerze v Ljubljani
- Abstract:
  Entanglement in proteins is a fascinating structural motif that is neither easy to detect via traditional methods nor fully understood. Recent advancements in AI-driven models have predicted that millions of proteins could potentially have a nontrivial topology. Herein, we have shown that long short-term memory (LSTM)-based neural networks (NN) architecture can be applied to detect, classify, and predict entanglement not only in closed polymeric chains but also in polymers and protein-like structures with open knots, actual protein configurations, and also ▫$theta$▫-curves motifs. The analysis revealed that the LSTM model can predict classes (up to the ▫$6_1$▫ knot) accurately for closed knots and open polymeric chains, resembling real proteins. In the case of open knots formed by protein-like structures, the model displays robust prediction capabilities with an accuracy of 99%. Moreover, the LSTM model with proper features, tested on hundreds of thousands of knotted and unknotted protein structures with different architectures predicted by AlphaFold 2, can distinguish between the trivial and nontrivial topology of the native state of the protein with an accuracy of 93%.
- File Description:
  application/pdf; text/url
- Relation:
  info:eu-repo/grantAgreement/ARIS//N1-0278-2023; info:eu-repo/grantAgreement/other/NCN - National Science Centre, Poland/2021%2F43%2FI%2FNZ1%2F03341; info:eu-repo/grantAgreement/other/NCN - National Science Centre, Poland/2022%2F47%2FB%2FNZ1%2F03480; https://plus.cobiss.net/cobiss/si/sl/bib/194735875
- Online Access:
  https://repozitorij.uni-lj.si/IzpisGradiva.php?id=166791
  https://repozitorij.uni-lj.si/Dokument.php?id=198703&dn=
  https://repozitorij.uni-lj.si/Dokument.php?id=198702&dn=
  https://plus.cobiss.net/cobiss/si/sl/bib/194735875
  https://hdl.handle.net/20.500.12556/RUL-166791
- Rights:
  http://creativecommons.org/licenses/by/4.0/ ; info:eu-repo/semantics/openAccess
- Accession Number:
  edsbas.FD8A43DC

Comments

No Comments.

Knots and $theta$-curves identification in polymeric chains and native proteins using neural networks

Contact

Follow us