Research

Queen Mary University of London (QMUL) · MSc (in progress)

AI-Driven Test Case Generation: A Hybrid Approach Combining Rule-Based Systems with Large Language Models

This thesis investigates a hybrid approach to automated test case generation that combines deterministic rule-based scaffolding with Large Language Model correction. Evaluated on a production CAD application (55 user stories, 870 test cases), the system achieved 92% time reduction, $0.002 per test case, and 94.4% acceptance criteria coverage.

Key Findings

  • 92% reduction in test creation time vs fully manual methods
  • First-pass quality rate of 72.9% (95% CI: 58.4%--84.3%)
  • Dominant failure mode: non-deterministic language in error-handling tests (38.9%)
  • Hybrid approach provides structural guarantees that pure LLM methods lack
  • LLM cost is negligible ($0.48 for 207 test cases) -- human review is the real cost

Research Questions

  • 01RQ1: Does hybrid generation reduce time vs manual?
  • 02RQ2: What is the first-pass structural quality rate?
  • 03RQ3: What are the dominant failure modes?
  • 04RQ4: How does hybrid compare to pure LLM approaches?