An automated real-time integration and interoperability framework for bioinformatics
[摘要] BackgroundIn recent years data integration has become an everyday undertaking for life sciences researchers. Aggregating and processing data from disparate sources, whether through specific developed software or via manual processes, is a common task for scientists. However, the scope and usability of the majority of current integration tools fail to deal with the fast growing and highly dynamic nature of biomedical data.ResultsIn this work we introduce a reactive and event-driven framework that simplifies real-time data integration and interoperability. This platform facilitates otherwise difficult tasks, such as connecting heterogeneous services, indexing, linking and transferring data from distinct resources, or subscribing to notifications regarding the timeliness of dynamic data. For developers, the framework automates the deployment of integrative and interoperable bioinformatics applications, using atomic data storage for content change detection, and enabling agent-based intelligent extract, transform and load tasks.ConclusionsThis work bridges the gap between the growing number of services, accessing specific data sources or algorithms, and the growing number of users, performing simple integration tasks on a recurring basis, through a streamlined workspace available to researchers and developers alike.
[发布日期] 2015-10-13 [发布机构]
[效力级别] [学科分类]
[关键词] Data integration;Interoperability;Publish/subscribe;Integration-as-a-service;Intelligent ETL;Workflow;Cloud;Service-oriented architecture;Event-driven [时效性]