Fixes some tests that weren't aggressively catching exceptions. Fixes some actual failures that were found because of this. Tries to fix an intermittent failure with crash1_test.dart which could be a race condition.
Also, make sure that broken tests actually break the bots. And add a test to make sure that keeps happening.