This change to the integration tests should properly isolate multiple runs of the same job. Previously, some jobs would fail if you attempted to re-run them via the UI, because the cache entries would be re-used from the previous run.
- Remove MacOS since the agents are current flaky - Add Windows to a few more test pipelines
Fixes #88