Breaking RDMP into its component parts
RDMP has Four(ish) component parts:
Data Load Engine
Data Warehousing (Catalogues)
Cohort Building
Data Extraction Engine
Data Load Engine
Data loads in RDMP are analogous to a standard ETL pipeline.
A Data load allows the user to:
Extract from a source
Transform it using the Raw/Staging pipelines and
Load it into a Data Warehouse (Catalogues in RDMP parlance).
RDMP data load pipelines are very loosely coupled, allowing for modular blocks to be used and reused across a number of pipelines and stages.
The Data Load Engine allows for scheduling and data backloading out of the box.
The Data Load Engine is designed with governance and auditability in mind.
Data Warehousing (Catalogues)
RDMP’s data warehousing is a flexible solution to allow for the storage, management and QA of your data.
Supporting a large number of storage methods, RDMPs catalogues stores your data how you want it stored.
Bundled with a data quality engine, RDMP provides you the tools required to ensure your data is correct and accurate, with as few missing data points as possible.
Cohort Building
RDMP comes with a powerful toolset to build cohorts from your data.
Using a modular structure, RDMP provides a way to build consistent, repeatable cohorts based on even the most complex of your data.
Data Extraction Engine
Much like the Data Load Engine, RDMPs data extraction engine mimics an ETL pipeline, allowing you to extract subsets of your data from a data warehouse. Using the same modular system, RDMP’s data extraction engine provides highly configurable, powerful data extractions.
Why use RDMP
Designed for use with sensitive data
Built with auditability and governance in mind
Has the flexibility to run on our hardware
RDMP scales to run on the smallest of virtual machines to the most powerful of machines
Simple setup and management
No need to worry about clusters or complicated infrastructure, RDMP comes as a single instance, deployable wherever you need it
Works with your data solution
RDMP doesn't prescribe technology or databases