MicroRNA identification using linear dimensionality reduction with explicit feature mapping
[摘要] BackgroundmicroRNAs are a class of small RNAs, about 20 nt long, which regulate cellular processes in animals and plants. Identifying microRNAs is one of the most important tasks in gene regulation studies. The main features used for identifying these tiny molecules are those in hairpin secondary structures of pre-microRNA.ResultsA new classifier is employed to identify precursor microRNAs from both pseudo hairpins and other non-coding RNAs. This classifier achieves a geometric mean Gm = 92.20% with just three features and 92.91% with seven features.ConclusionThis study shows that linear dimensionality reduction combined with explicit feature mapping, namely miLDR-EM, achieves high performance in classification of microRNAs from other sequences. Also, explicitly mapping data onto a high dimensional space could be a useful alternative to kernel-based methods for large datasets with a small number of features. Moreover, we demonstrate that microRNAs can be accurately identified by just using three properties that involve minimum free energy.
[发布日期] 2013-12-20 [发布机构]
[效力级别] [学科分类]
[关键词] Support Vector Machine;Feature Selection;Radial Basis Function;Feature Selection Method;Minimum Free Energy [时效性]