DeepCleave: A deep learning predictor for caspase and matrix metalloprotease substrates and cleavage sites

Academic Article


  • Motivation: Proteases are enzymes that cleave target substrate proteins by catalyzing the hydrolysis of peptide bonds between specific amino acids. While the functional proteolysis regulated by proteases plays a central role in the 'life and death' cellular processes, many of the corresponding substrates and their cleavage sites were not found yet. Availability of accurate predictors of the substrates and cleavage sites would facilitate understanding of proteases' functions and physiological roles. Deep learning is a promising approach for the development of accurate predictors of substrate cleavage events. Results: We propose DeepCleave, the first deep learning-based predictor of protease-specific substrates and cleavage sites. DeepCleave uses protein substrate sequence data as input and employs convolutional neural networks with transfer learning to train accurate predictive models. High predictive performance of our models stems from the use of high-quality cleavage site features extracted from the substrate sequences through the deep learning process, and the application of transfer learning, multiple kernels and attention layer in the design of the deep network. Empirical tests against several related state-of-the-art methods demonstrate that DeepCleave outperforms these methods in predicting caspase and matrix metalloprotease substrate-cleavage sites.
  • Published In

  • Bioinformatics  Journal
  • Digital Object Identifier (doi)

    Author List

  • Li F; Chen J; Leier A; Marquez-Lago T; Liu Q; Wang Y; Revote J; Smith AI; Akutsu T; Webb GI
  • Start Page

  • 1057
  • End Page

  • 1065
  • Volume

  • 36
  • Issue

  • 4