Rud Merriam said:
> The test output from the Lisp version (Push2, as I recall) is insufficient
for really good testing since it only reports successful operations. For
instance, it doesn't test for missing stack values: "1 INTEGER.+".
That's a reason for a format that can include detailed information about the interpreter state after each execution step and other events.