AI can generate unit tests in seconds. But how do you know whether those tests are actually useful?

Most teams still rely on code coverage and pass rates to evaluate their test suites. The problem is that a test can pass, increase coverage, and still provide little or no additional confidence.

We've been seeing examples where AI-generated tests:

Duplicate existing coverage

Depend on system time or GUID generation