JetBrains Research is a private enterprise created to unite scientific projects that really make a difference and strive to improve a current state of science and technology. With the support of JetBrains, researchers and teams can focus on the actual work, instead of grant seeking or dealing with other management issues.
3rd International Workshop on Refactoring (IWoR),
16th International Conference on Mining Software Repositories (MSR),
Proceedings of the 12th International Joint Conference on Biomedical Engineering Systems and Technologies - BIOINFORMATICS,
We propose a way to combine formal grammars and artificial neural networks for biological sequences processing. Formal grammars encode the secondary structure of the sequence and neural networks deal with mutations and noise. In contrast to the classical way, when probabilistic grammars are used for secondary structure modeling, we propose to use arbitrary (not probabilistic) grammars which simplifies grammar creation. Instead of modeling the structure of the whole sequence, we create a grammar which only describes features of the secondary structure. Then we use matrix-based parsing to extract features: the fact that some substring can be derived from some nonterminal is a feature. After that, we use a dense neural network to process features.