Common Data Model

A secure lakehouse that unifies SIS, LMS, CRM, web, and other sources into one trusted schema—powering analytics and agentic AI across campus.

Near real-time pipelines RBAC & audit trail API-first
At a glance
  • 🧩 Unified identities & events across systems
  • 🔌 Connectors for SIS/LMS/CRM & files (CSV, PDF, forms)
  • 📊 Clean marts for analytics, dashboards & AI agents
  • 🛡️ Encryption in transit/at rest, PII minimization
  • ⏱️ Typical rollout: 4–6 weeks

What it is

The Common Data Model (CDM) is a governance-ready lakehouse schema for higher education. It standardizes people, courses, sections, enrollments, interactions, and outcomes, and supports event-style telemetry so every downstream service—from alerts to planning to career—works off the same truth.

What it does

Ingests and reconciles data from your SIS, LMS, CRM, and web sources; resolves identities; normalizes entities and events; and publishes clean data marts and feature sets to analytics tools and Infinize’s agentic services.

Key capabilities

Unified identity graph
Deterministic + probabilistic matching to reconcile duplicate records across SIS/LMS/CRM; persistent keys for reporting and AI features.
Entity & event schema
Normalize people, programs, courses, sections, enrollments, outcomes, and timestamped events (logins, submissions, nudges, meetings).
Pipelines & data quality
Scheduled/streaming ingestion with validation rules, anomaly flags, and freshness SLAs visible to data stewards and IT.
Privacy by design
Field-level policies, PII minimization, masking in non-prod, lineage and audit logs; ready for FERPA/GDPR controls with RBAC.
Ready for analytics & AI
Feature tables and marts that power dashboards, risk signals, planning validators, universal agent actions, and career matching.
API-first & extensible
Secure REST/Graph endpoints and exports to BI tools; bring-your-own models and notebooks with governed access.

Integrations

Connect quickly with prebuilt connectors and a secure ingestion framework.

SIS (Banner, PeopleSoft, Colleague)
LMS (Canvas, Blackboard)
CRM (Salesforce, Slate)
Files & Web (CSV, PDF, Sites)
Don’t see your system? We support custom connectors via APIs, SFTP, and events.

Outcomes you can measure

Single source of truth
Reduces reconciliation time for IR/IT and eliminates report drift.
Faster insights
Near real-time marts enable timely dashboards and signals.
Agent-ready data
Universal Agent, Alerts, Planning, and Career run on the same clean model.
Lower TCO
Consolidated pipelines, reusable marts, and fewer bespoke extracts.

Security, privacy, and ethics

Security & privacy
  • Encryption at rest/in transit; VPC-isolated data plane
  • Role-based access control (RBAC) & least privilege
  • Field-level masking; secrets vault; key rotation
  • Audit trail for data access and agent actions
  • PII minimization and non-prod redaction
Responsible AI
  • Human-in-the-loop approvals for sensitive actions
  • Model cards & explainability for risk signals
  • Bias checks on features and outcomes
  • Opt-outs, purpose limitation, and data retention controls
Compliant with institutional policies and aligned to FERPA/GDPR principles.

Implementation timeline

Week 1
Kickoff & access
Environments, security review, source inventory.
Weeks 2–3
Pipelines & mapping
Connect SIS/LMS/CRM, identity resolution, DQ rules.
Week 4
Marts & features
Publish marts for dashboards and agent services.
Weeks 5–6
UAT & rollout
Validation, runbooks, training, go-live.

FAQs

It’s purpose-built for higher-ed entities/events, supports near real-time ingestion, and publishes feature tables for AI agents in addition to analytics marts.

Yes. Deploy in your preferred cloud or hybrid model. We support secure connectivity to on-prem systems and follow your data residency policies.

Versioned schema definitions, migration runbooks, column-level lineage, and automated data quality checks with alerts to stewards/IT.

Role- and attribute-based access, masking of sensitive fields, environment separation, and detailed access logs for audit and compliance.

Most institutions surface first dashboards and enable one or two agentic services within 4–6 weeks, starting with your highest-impact data sources.

Ready to put a trusted data foundation under every decision and AI agent?