Five upstream sources feed the platform: Git commit logs (history across Elasticsearch, Spring Boot, Hadoop, Kafka, Express), issue / defect history (keyword labelling on bug tags), live DOM snapshots from Selenium and Playwright sessions, natural-language requirements (312 across three domains), and test-run telemetry from real CI executions.
Sources are intentionally heterogeneous: the platform's job is to fuse them into a single risk model and a single triage signal.