English
 
Help Privacy Policy Disclaimer
  Advanced SearchBrowse

Item

ITEM ACTIONSEXPORT
  Transfer Learning Allows Accurate RBP Target Site Prediction with Limited Sample Sizes

Vaculík, O., Chalupová, E., Grešová, K., Majtner, T., & Alexiou, P. (2023). Transfer Learning Allows Accurate RBP Target Site Prediction with Limited Sample Sizes. Biology, 12(10): 1276. doi:10.3390/biology12101276.

Item is

Files

show Files
hide Files
:
biology-12-01276.pdf (Any fulltext), 3MB
Name:
biology-12-01276.pdf
Description:
-
OA-Status:
Not specified
Visibility:
Public
MIME-Type / Checksum:
application/pdf / [MD5]
Technical Metadata:
Copyright Date:
-
Copyright Info:
-
License:
-

Locators

show

Creators

show
hide
 Creators:
Vaculík, Ondřej1, 2, Author
Chalupová, Eliška2, Author
Grešová, Katarína1, 2, Author
Majtner, Tomáš1, 3, Author                 
Alexiou, Panagiotis1, 4, 5, Author
Affiliations:
1Central European Institute of Technology (CEITEC), Masaryk University, Brno, Czech Republic, ou_persistent22              
2Faculty of Science, National Centre for Biomolecular Research, Masaryk University, Brno, Czech Republic, ou_persistent22              
3Department of Molecular Sociology, Max Planck Institute of Biophysics, Max Planck Society, ou_3040395              
4Department of Applied Biomedical Science, Faculty of Health Sciences, University of Malta, Msida, Malta, ou_persistent22              
5Centre for Molecular Medicine & Biobanking, University of Malta, Msida, Malta, ou_persistent22              

Content

show
hide
Free keywords: CLIP-seq, deep learning, interpretation, RNA-binding protein, transfer learning
 Abstract: RNA-binding proteins are vital regulators in numerous biological processes. Their disfunction can result in diverse diseases, such as cancer or neurodegenerative disorders, making the prediction of their binding sites of high importance. Deep learning (DL) has brought about a revolution in various biological domains, including the field of protein–RNA interactions. Nonetheless, several challenges persist, such as the limited availability of experimentally validated binding sites to train well-performing DL models for the majority of proteins. Here, we present a novel training approach based on transfer learning (TL) to address the issue of limited data. Employing a sophisticated and interpretable architecture, we compare the performance of our method trained using two distinct approaches: training from scratch (SCR) and utilizing TL. Additionally, we benchmark our results against the current state-of-the-art methods. Furthermore, we tackle the challenges associated with selecting appropriate input features and determining optimal interval sizes. Our results show that TL enhances model performance, particularly in datasets with minimal training data, where satisfactory results can be achieved with just a few hundred RNA binding sites. Moreover, we demonstrate that integrating both sequence and evolutionary conservation information leads to superior performance. Additionally, we showcase how incorporating an attention layer into the model facilitates the interpretation of predictions within a biologically relevant context.

Details

show
hide
Language(s): eng - English
 Dates: 2023-09-192023-08-152023-09-212023-09-25
 Publication Status: Published online
 Pages: 19
 Publishing info: -
 Table of Contents: -
 Rev. Type: Peer
 Identifiers: DOI: 10.3390/biology12101276
BibTex Citekey: vaculik_transfer_2023
 Degree: -

Event

show

Legal Case

show

Project information

show

Source 1

show
hide
Title: Biology
Source Genre: Journal
 Creator(s):
Affiliations:
Publ. Info: Basel, Switzerland : MDPI
Pages: - Volume / Issue: 12 (10) Sequence Number: 1276 Start / End Page: - Identifier: Other: 2079-7737
CoNE: https://pure.mpg.de/cone/journals/resource/2079-7737