Trusting AI: evaluation as engineering discipline
For decades, software quality has been a solved organizational problem, or at least a well-understood one. Teams write tests. Tests run automatically. When a change breaks something, the pipeline catches it before it reaches production. This discipline, built up painfully over thirty years of software engineering practice, is why modern development teams can ship multiple … Read more