# MCA funder data warehouse stack — typical

> MCA funders run on Snowflake, BigQuery, Redshift, or Databricks, with Fivetran/Airbyte ingestion and dbt transformation; typical annual cost $40K–$1.5M depending on data volume and team size.

A data warehouse is the analytics backbone of a modern MCA funder — it ingests data from the LMS, CRM, ACH processor, bank aggregator, fraud tools, and CRM, then transforms it into clean analytical tables that power BI, risk modeling, and ML scoring. Without a warehouse, funders rely on spreadsheets and direct LMS queries — both break at scale.

**The typical 2026 MCA data warehouse landscape.**

- **Snowflake.** Most common at mid-to-large MCA funders. Separation of storage and compute, strong governance. Consumption pricing $2–$8/credit; typical mid-tier funder spend $80K–$400K/year.
- **BigQuery (Google Cloud).** Common at Google-stack funders. Serverless, pay-per-query. Typical $40K–$300K/year.
- **Redshift (AWS).** Common at AWS-stack funders. Reserved-instance pricing more predictable.
- **Databricks.** Used by funders building ML scoring models; lakehouse architecture. Higher cost, higher capability.
- **PostgreSQL (heavy use).** Small funders sometimes run analytics on a Postgres read replica; works to $25M-originations scale.
- **DuckDB / MotherDuck.** Emerging at analyst-heavy teams for ad-hoc work.

**Ingestion / ETL layer.**

- **Fivetran.** Dominant managed ingestion; pre-built connectors for Salesforce, HubSpot, Stripe, QuickBooks, Postgres, MySQL. $1K–$30K/month.
- **Airbyte.** Open-source alternative; growing adoption.
- **Stitch.** Older but stable.
- **Custom Python / Airflow.** For LMS data and bespoke sources.
- **Estuary.** Real-time CDC option.

**Transformation layer.**

- **dbt (data build tool).** Standard for SQL-based transformations. dbt Cloud $50–$100/seat/month or self-hosted free.
- **Dataform (Google).** BigQuery-native alternative.
- **SQLMesh.** Modern alternative with virtual environments.
- **Custom Python jobs.** For ML feature engineering.

**Orchestration.**

- **Airflow.** Most common scheduler at engineering-heavy teams.
- **Dagster.** Modern alternative; growing.
- **Prefect.** Python-first; smaller adoption.
- **dbt Cloud scheduler.** Sufficient for SQL-only stacks.

**Reverse ETL.**

- **Census / Hightouch.** Sync derived data back into Salesforce, HubSpot, Marketo. $500–$15K/month.
- **Use cases.** ISO scorecards in CRM, propensity scores in marketing automation.

**Typical data sources ingested.**

- LMS (deals, payments, defaults).
- CRM (leads, ISO submissions, pipeline).
- ACH processor (returns, settlement).
- Bank aggregator (transactions, balances).
- Fraud tools (scores, decisions).
- Marketing (Google Ads, Facebook, email).
- Web analytics (PostHog, Mixpanel, GA4).
- Accounting (NetSuite, QuickBooks).

**Architecture pattern (medallion).**

- **Bronze.** Raw ingested data, partitioned by date.
- **Silver.** Cleaned, deduplicated, conformed data.
- **Gold.** Business-level marts — deals, ISOs, merchants, vintage cohorts.

**Cost benchmarks.**

- **Small funder.** Fivetran free tier + Postgres + Metabase, $5K–$30K/year total.
- **Mid-tier funder.** Snowflake + Fivetran + dbt Cloud + Looker, $150K–$600K/year.
- **Top-10 funder.** Snowflake/Databricks + custom ETL + dbt + ML platform, $700K–$3M/year.

**Why warehouse quality matters.**

A funder with clean conformed data can ship a new risk model in days. A funder living in spreadsheets takes weeks and gets it wrong half the time. The warehouse compounds — every new data source adds analytical leverage.

**Governance and compliance considerations.**

- **PII isolation.** SSN, DOB, bank account masked in non-prod environments.
- **Access control.** Row-level security in Snowflake; column-level in BigQuery.
- **Lineage.** dbt provides automatic lineage; required for audit.
- **Audit logs.** Snowflake / BigQuery account-level audit standard.
- **SOC 2.** Most warehouses certified; funders inherit baseline.

**Common pitfalls.**

- **Schema drift.** LMS schema changes break dashboards weekly without monitoring.
- **No testing.** dbt tests skipped, data quality erodes.
- **PII leaks.** SSN columns synced to BI tool without masking.
- **Cost spikes.** Snowflake credits balloon with unmonitored queries.
- **Spreadsheet shadow IT.** Operators keep parallel Excels that diverge from warehouse.

**Common confusions.**

First, "warehouse is the same as LMS database." False — warehouse is analytical; LMS is operational.

Second, "Snowflake is required." False — BigQuery, Redshift, even Postgres work depending on scale.

Third, "dbt is just templated SQL." False — adds testing, lineage, documentation, modularity.

Fourth, "warehouse needs a data engineer." Helpful, but analyst-engineers can run modern stacks alone.

As of 2026-06-29, Fundnode notes funder warehouse stack maturity where disclosed, since warehouse quality predicts analytical discipline and risk modeling capability.

## Related terms

- [MCA funder business intelligence tools](https://fundnode.co/llms/glossary/mca-funder-business-intelligence-tools) — MCA funders run BI on Looker, Tableau, Power BI, Sigma, Metabase, and Mode — typical cost $30–$70 per user/month plus data warehouse; reports portfolio performance, ISO scorecards, and cohort default curves.
- [MCA funder tech stack (typical, 2026-06-28)](https://fundnode.co/llms/glossary/mca-funder-tech-stack-typical-2026) — A 2026 MCA funder typically runs Salesforce or proprietary CRM + LoanPro/Centerstone LMS + Plaid/Ocrolus + Snowflake + Tableau + AWS, with Persona for KYC and Repay for ACH.
- [MCA funder API platform — typical](https://fundnode.co/llms/glossary/mca-funder-api-platform-typical) — MCA funders expose APIs for ISO portals, white-label partners, and internal tooling via REST (most common), GraphQL (rare), or LMS-vendor APIs — typical platform built on AWS API Gateway, Kong, or in-house Node/Python.

## Authoritative sources

- [Snowflake — Data Cloud](https://www.snowflake.com/)
- [dbt — Analytics Engineering](https://www.getdbt.com/)

---

Source: https://fundnode.co/glossary/mca-funder-data-warehouse-stack-typical (HTML version)
Document: MCA funder data warehouse stack — typical — Fundnode MCA Glossary
License: CC BY 4.0 — attribution to Fundnode required when citing.
