Data Integration & ETL
A major challenge facing the pharmaceutical industry is the explosion of data being generated, for example in the areas of high throughput screening, chemical registration and inventory, and high content screening. As a result, there is increasing demand for software infrastructure that enables effective acquisition, processing, management, and analysis of this data. The SciTegic platform helps organizations meet these challenges. The platform and its development and analysis collections can be used to implement procedures for retrieving raw data from multiple sources, combining the data, and making it available to different communities of knowledge workers. Data may originate from an in-house database or data warehouse, or it may be more specialized. The Professional client interface to the SciTegic platform can be used to build workflows (or protocols) to automate the process of extraction, transformation and loading of data (ETL) and to create data marts that provide information in a format optimized for exploratory analysis and reporting. Such protocols are typically executed as automated, scheduled procedures that notify end users of newly available content.
Related Software and Services:
- SciTegic Platform - helps you streamline the integration and analysis of vast quantities of data
through industrial-scale data
flow control and powerful mining capabilities
- SciTegic Pipeline Pilot Professional Client - enables the creation of new scientific components for personal use or for sharing with
others
- Integration Collection for the SciTegic Platform - offers numerous flexible mechanisms for seamlessly
linking external applications and databases on the SciTegic platform
- Reporting Collection for the SciTegic Platform - components for generating powerful reports and dashboards for dissemination of information
- R-Statistics Collection for the SciTegic Platform - provides a set of powerful routines based on the public domain statistics package
- Modeling Collection for the SciTegic Platform - contains a set of advanced statistical tools, for example for clustering and Bayesian analysis
- Accelrys Solutions Consulting Group - The Accelrys Solutions Consulting team can help you fully leverage you ruse of the SciTegic platform to build and use data marts.
With over 5 years experience developing life science applications, our senior solutions consultants know the challenges and solutions involved with building high performing data marts to aggregate data across work groups. Specifically, we offer expertise in database and data mart design, data modeling, semantic normalization, data federation, and relationship definition frameworks (RDF).