Looking for suggestions on improving parallelism #3164

bradwilson · 2025-02-10T04:48:57Z

bradwilson
Feb 10, 2025
Maintainer

We have two open issues right now related to how tests run in parallel:

As I work through what the options for parallelism should be, I'm thinking maybe trying to set up a strict list of choices is probably the wrong strategy, so here's what I'm considering.

For starters, we should state where we are today: we have two modes of parallelization, with some modifiers.

The first is "nothing runs in parallel"; that one is easy to understand.
The second is "tests within a single test collection do not run in parallel against each other, but tests from different test collections can run in parallel against each other". Further, we have a flag together on test collections which says "this test collection cannot be run in parallel against other test collections", so in any given project, there are 0 or more test collections opting out of parallelism entirely.

Additionally, we the two issues linked again are requests for:

Run everything in parallel, always. (I assume there would need to be some opt out here as well, like we have with test collections. The mechanism here is TBD.)
If tests don't have any shared context (no fixtures), they're eligible for being run in parallel against any other test, even in the same test class; if there are fixtures, then the old rules apply.

Now that we're at (at least) four different ways of defining how to run things in parallel, it feels like the old way of specifying parallelism isn't sufficient. So I think I want to change how we specify intra-assembly parallelism in configuration (remove parallelizeTestCollections) and command line switches (rework -parallel), and instead replace it with a user-customizable "parallelism sorter".

I'm not 100% sure whether this design is where I'll land, but it's a starting point for discussion.

The component would implement some interface, and be registered at the assembly-level. It would me given a list of all the test cases, and it would sort them into groups which serve two functions: (a) zero or one "non-parallel" group, which means any test case in that group is not run in parallel against any other test (this is to accommodate the "non-parallel" test collections today, as well as being extended to perhaps the test class and/or test method level); (b) zero or more "parallel" groups which contain tests which cannot be run in parallel against each other within the same group, but can be run in parallel against any test in any other "parallel" group.

For the purposes of illustration below, let's assume that there are ways to opt out of parallelism on a per-test-collection basis (that exists today, via CollectionDefinition.DisableParallelization = true) as well as perhaps on a per-test-class and per-test-method basis. We'll just call those "non-parallel tests" for simplicity.

The default behaviors would work like this:

Nothing runs in parallel

Everything is placed into the "non-parallel" group. No "parallel" groups exist.

Test collections run in parallel against each other

The "non-parallel tests" go into the "non-parallel" group. Create one "parallel" group per remaining test collection, containing all the tests in that collection.

The new behaviors could be accommodated with:

Everything runs in parallel

The "non-parallel tests" go into the "non-parallel" group. Create one "parallel" group per remaining test.

Disable parallelism only when there is shared context

The "non-parallel tests" go into the "non-parallel" group. For every test that has some shared fixture instance (excepting assembly-level fixtures), group them together into a "parallel" group, per shared fixture instance. For every test that does not have any fixtures, create one "parallel" group per test.

I think the "shared fixture instance" is a bit complicated, so here's my thinking about this.

We share instances of fixture data from ICollectionFixture<> and IClassFixture<>. That means in this definition, anybody in a test collection with any instances of ICollectionFixture<> attached to it end up in a group together (because they're sharing the collection fixture instance). Then, any test classes which have IClassFixture<> (either directly or via a collection definition) but no ICollectionFixture<> would cause all the tests in that class to be in a group together.

It's important to remember that this is about shared fixture instances. That means, in the scenario below, we'd end up with two "parallel" groups here, not one, because despite sharing a fixture type, the don't share a fixture instance:
public class TestClass1 : IClassFixture<MyFixture> { }

public class TestClass2 : IClassFixture<MyFixture> { }
Ditto for multiple test collections which both use the same fixture type via ICollectionFixture<>, since they'll be sharing the type but not the instance.

That's the sum total of my thoughts. Removing this as a configuration item with built-in behavior, and instead replacing it with a user-customizable, compile-time (assembly-level) choice, should allow us to be able to add new rule sets later without mucking around with configuration options. It should also allow users to do unusual things that seem right for them, like my (currently over complicated) sample of parallelizing based on namespace, which would become fairly trivial in the new design.

So here are my open questions:

What do you think of this design?
a. Is the design reasonable, or too complex? Do you have an alternate design that's cleaner/simpler while still allowing for all the requirements?
b. Do you have parallelization requirements that are different than the four options shown here? Can you accomplish those goals with the generalized design here?
What do you think of the idea of removing the intra-assembly parallelism options (configuration files, command line switches, etc.)? What about keeping the inter-assembly parallelism options (for the multi-assembly runners like our first party Console and MSBuild runners)? Should we also remove those configuration file options and leave this decision solely to the runner based on command line options?

PureKrome · 2025-02-10T07:49:19Z

PureKrome
Feb 10, 2025

🥰Thank you @bradwilson for starting/hosting this conversation. Awesome!

Next, I've silently never really liked the 2x defaults. I "get it" but just never really liked it.

I've always wanted to have every test run in parallel. Not just every test class run in parallel.

I'm a strong supporter of test isolation so I really try to avoid shared context as much as possible. That said I have been doing Assembly/Project level shared context -> create test containers once at start of test run. Then have my tests all want to run in parallel after that against the shared context.

Sometimes I've felt like I wanted Test Collections where the 'collection' does a single setup (eg. connect to a db or create a specific test container) but then all the tests after that are parallel.

1a: I feel like this design is reasonable. I would be generally sitting in the "Everything runs in parallel" option. But as mentioned above, I use Test Containers which I would run at the start of the run, once. This would then be a SHARED CONTEXT which means I now loose parallelism, so I'm not done/out :(

1b: As mentioned above - yes I do. I currently don't have a "clean" work around. Here's my opinionated scenario: Isolated DB Tests with a single db per test.

Run start: Test Container creates a DB (whatever flavor-flav ⏱️ u like)
Each test now runs in proper parallel.
- Each tests would create a new DB tenant in the single DB Instance. Each tenant name is unique to avoid clashing
- Each test would decide how the DB schema and data would look like. Simple example would be all tables/views/sp (urgh) are generated and data is seeded.
All Tests now run.
💸

I'm doing the above in xUnit v3 but it's the default class-level parallel.

Because of the first step which is the shared context, then option 4/Disable parallelism only when there is shared context would really hurt me, if this setting was auto detected/set.

I'm unable to answer this because I don't think I understand it correctly. (It's a "me" issue). FWIW, I run my tests in VS using the default Test Runner thingy (and trying to use the NEW MS test Plaform thingy). I then also run tests in CI/CD so this is all CLI. I always prefer setting stuff via CLI options, not config files. I guess if this means: "How do i configure the VS Test runner", then?

--
Thank you again @bradwilson 🫶

0 replies

bradwilson · 2025-02-16T18:53:19Z

bradwilson
Feb 16, 2025
Maintainer Author

Looking at how this gets implemented, I am wondering about our current messages and what should be done.

Let's assume we implement this parallelism sorting system, and let's assume someone has chosen "everything runs in parallel". Each parallel "group" is a single test case. Let's assume the developer has three tests in one test class, and four tests in another test class, and both those test classes are in the same test collection.

We have starting/finished messages for each layer of execution:

ITestAssemblyStarting / ITestAssemblyFinished
ITestCollectionStarting / ITestCollectionFinished
ITestClassStarting / ITestClassFinished
ITestMethodStarting / ITestMethodFinished
ITestCaseStarting / ITestCaseFinished
ITestStarting / ITestFinished

The assembly level message is easy to ensure we get one pair. 😄 However, all the way down this stack, in the existing model we guarantee only one pair per element is sent, because of the way we've defined parallelism. For example, all tests in a collection are run sequentially against each other, so by virtue of that grouping, we can guarantee that there is one singular place from which to send the singular pair of test collection starting & finished messages for a given test collection.

In a model where everything can be parallelized, given my example above, I have 7 test cases in the test collection, but they're all being run in parallel against each other. I think this leaves at least three options for dealing with the message pairs:

~~Send the pair 7x, because the collection is being started and finished 7x?~~
Send the pair 1x, but this requires you pre-sort all the test cases and group them by test collection so you know when the first one starts and the last one ends?
Don't send these messages at all any more?

I'm concerned that the most "correct" way to do this is by the by-far most computationally and memory expensive version (don't forget, everything I'm talking about here applies to test classes and test methods as well). Not only do we need to compute those groupings and keep them for the duration of the run, we also need to add a potential parallelism bottleneck since we have to lock around the collections so we can accurately keep track of first start vs. last finish, which feels like a parallelism bottleneck of sorts (and the faster the tests are at executing, the larger a %age of the execution time is spent in this bookkeeping).

Part of me wonders whether these messages are providing value to any runner out there. We certainly don't use them in any of our runners; the only thing people tend to care about is actually test starting/test result/test finished. So part of me wonders if I should just remove these messages entirely (at least the test collection/test class/test method versions).

Edit: I removed the 7x option due to my reply below

1 reply

bradwilson Feb 16, 2025
Maintainer Author

Actually, the more I think about it, it seems impossible to do anything other than the heavy handed sorting and tracking.

TestCollectionRunner, which today is responsible for the messages, also gives us a place to do things like creating and cleaning up the collection fixtures. That still has to be tracked and serialized appropriately, so that the first test case that wants to start gets all the collection fixture instances initialized, and the last one that stops gets all the collection fixture instances cleaned up.

Even if we decided to stop sending the messages, we'd still have all this other bookkeeping associated with the test context to track.

Piedone · 2025-02-27T16:21:16Z

Piedone
Feb 27, 2025

I'd just add that we have the parallelization requirement of "parallelize everything in a given assembly, but don't run tests of different assemblies in parallel". So, supporting a pluggable model of determining which tests to parallelize, with implementation for a couple of common scenarios, looks like a good approach.

0 replies

PureKrome · 2025-05-29T00:27:55Z

PureKrome
May 29, 2025

👋🏻 Hi @bradwilson - just touching base. Is there anything we can help with here, to try and get some momentum with this?

I'm firmly in the camp "Run everything in parallel, always" with ways to opt-out -per method-. Like a trait or attribute or something.

Are you just trying to find some time to make some decisions here?

2 replies

bradwilson May 30, 2025
Maintainer Author

This has not yet bubbled up to the top of the stack of "very large" work items. I will generally only do one of those at a time. As such, I have no estimate for when this might get implemented.

PureKrome May 30, 2025

No probs. Appreciate the update 👍🏻

Looking for suggestions on improving parallelism #3164

Uh oh!

bradwilson Feb 10, 2025 Maintainer

Nothing runs in parallel

Test collections run in parallel against each other

Everything runs in parallel

Disable parallelism only when there is shared context

Replies: 4 comments · 3 replies

Uh oh!

Uh oh!

PureKrome Feb 10, 2025

Uh oh!

Uh oh!

bradwilson Feb 16, 2025 Maintainer Author

Uh oh!

bradwilson Feb 16, 2025 Maintainer Author

Uh oh!

Piedone Feb 27, 2025

Uh oh!

PureKrome May 29, 2025

Uh oh!

bradwilson May 30, 2025 Maintainer Author

Uh oh!

PureKrome May 30, 2025

bradwilson
Feb 10, 2025
Maintainer

Replies: 4 comments 3 replies

PureKrome
Feb 10, 2025

bradwilson
Feb 16, 2025
Maintainer Author

bradwilson Feb 16, 2025
Maintainer Author

Piedone
Feb 27, 2025

PureKrome
May 29, 2025

bradwilson May 30, 2025
Maintainer Author