Have a structured way of gathering test results, rather than the existing ad hoc approach of just printing stuff. The details are still pretty primitive, but there's room to grow.
Do a tree reduction in addition to the existing decoupled look-back, to explore the tradeoff between performance and compatibility.