System Architecture
GoodData is engineered for high-frequency event ingestion and real-time predictive modeling. We prioritize low-latency writes and asynchronous processing to ensure your application performance is never impacted.
The High-Level Flow
Data moves through a high-performance pipeline designed for speed and intelligence.
Components Explained
Client Layer
The Intelligence Engine is a lightweight JavaScript library (15kb) that runs in your application. It collects behavioral signals, detects user intent, and sends events asynchronously to avoid blocking your app.
Ingestion Layer
The Ingestion API is built on Node.js/Express and handles high-volume event streams. Events are buffered and written to persistent storage with deduplication and validation.
Persistence Layer
PostgreSQL stores raw events for compliance and debugging. TimescaleDB handles time-series aggregations and serves queries for dashboards and ML pipelines.
Intelligence Layer
Python-based ML workers run asynchronously, processing event batches to generate predictions, anomaly detection, and behavioral insights that feed back into real-time recommendations.
Key Design Principles
- Low Latency: Events processed in milliseconds
- High Throughput: Millions of events per second
- Data Privacy: Raw events isolated from analytics
- Scalability: Horizontal scaling of workers
- Real-time: Streaming predictions with sub-second latency
Learn more in our Getting Started guide.