Simulation configuration

Current model sample size is 4000. However, since we use a Markov Chain sampler, the samples are correlated: the effective sample size varies between different parameters, within the range (1111, 1.0252^{4}).

Posterior model performance

Recorded matches with model's expected probability of observed result. Fairness rates the fairness of the matchup, including players, between 0 and 1.
match name	result probability	fairness
	0.111807093061072 0.983949341400140	0.032101317199720 0.997462389784560
RACE2 Round 1, Game 1: cstick [Demonology]/Anarchy/Law vs. EricF [Past]/Blood/Peace, won by EricF	0.74	0.53
RACE2 Round 1, Game 2: zhavier [Fire]/Growth/Necromancy vs. YoungBuck [Past]/Anarchy/Peace, won by zhavier	0.76	0.49
RACE2 Round 2, Game 1: EricF [Past]/Blood/Peace vs. zhavier [Fire]/Growth/Necromancy, won by EricF	0.63	0.73
CAFS16 Round 1, Game 3: cstick [Demonology]/Anarchy/Law vs. Drakona [Blood/Fire]/Finesse, won by cstick	0.80	0.41
CAFS16 Round 1, Game 1: PiHalbe [Demonology/Disease/Necromancy] vs. Barrelfish [Necromancy]/Blood/Growth, won by Barrelfish	0.52	0.96
CAFS16 Round 1, Game 7: Zejety [Finesse]/Discipline/Strength vs. Lettucemode [Balance/Feral/Growth], won by Zejety	0.79	0.41
CAFS16 Round 1, Game 8: Anemone [Demonology/Disease/Necromancy] vs. jasonwocky [Future/Past/Present], won by Anemone	0.81	0.39
CAFS16 Round 1, Game 5: bravo840 [Necromancy]/Blood/Ninjitsu vs. FrozenStorm [Past]/Anarchy/Peace, won by FrozenStorm	0.82	0.37
CAFS16 Round 1, Game 4: Bryce_The_Rice [Feral]/Strength/Truth vs. EricF [Past]/Anarchy/Peace, won by EricF	0.85	0.30
CAFS16 Round 1, Game 9: Pooh_Bear [Demonology/Disease/Necromancy] vs. PointyFinger [Anarchy/Blood/Fire], won by Pooh_Bear	0.64	0.72

Showing 1 to 10 of 979 entries

Previous1 2 3 4 5…98Next

If the model predicted matches by saying the most likely winner would win, its performance on the above matches, which were used to fit it, would be 789 out of 979 (80.6%). Since it gives a probability of each player winning, we can use proper scoring rules instead (the closer to zero, the better):

model	log score	Brier score
predict every match as 5-5	0.693	0.250
current model	0.457	0.146

Player skill

Player skills are given as their (additive) effect on their log-odds of winning a match. Skill is currently assumed to not change over time, so given skill levels for long-absent players are narrowly-distributed compared to how certain we’d really be about their current skill level. It’ll also favour players who were veterans before the earliest recorded match, because the period where they learned the ropes is not included in their match records.

Here’s the same results, narrowed down to players that have played in the last year (i.e. have finished a recorded match on 2021-09-14 or later):

player	mean skill	prob. best (n = 4000)	prob. best active (n = 4000)
	-2.439835053262500 2.079215646500000	0.00000 0.29675	0.00000 0.70775
Akiata	-0.382	0
Alhazard	0.132	0.0015
Andreas	-0.382	0.00025
Anemone	1.127	0.05025
ARMed_PIrate	-0.916	0
bansa	2.079	0.29675	0.70775
Barrelfish	-0.922	0
Bob199	-0.182	0
bolyarich	1.386	0.022
bravo840	-0.796	0

Showing 1 to 10 of 75 entries

Previous1 2 3 4 5…8Next

Opposed component effects

Decks are treated in opposing pairs: P1 starter versus P2 starter, P1 starter versus each P2 spec, and so on. Each match has 16 such pairs. Each pair’s effect is given as its additive effect on the log-odds of a player 1 victory. The component’s effects are added to given overall matchup between the decks, before accounting for player skill levels.

Note that these pair effects are not direct appraisals of how the components fare against each other. For each, the Green vs. Black effect doesn’t assess how those two decks decks match up against each other, it assesses how decks using those starter decks tend to match up against each other. Similarly, Blood vs. Future doesn’t, directly assess how those two specs compete at, say, Tech II, because I don’t record tech building choices. Instead, it shows how P1 decks including Blood tend to fare against P2 decks including Future. Note that this also ignores interactions between different pairs completely.

To examine the matchup between two particular decks, add their components in the relevant Deck components column. The overall matchup is then given below the table, as both the log-odds and the probability of a player 1 victory. Individual pair effects are given in the displayed table rows.

Currently I’ve not added the players in the same table to account for skill effects. In the meantime, since player skill tends to have a larger effect than the deck matchup, don’t compare the deck matchups to your own match outcomes too strictly, unless you can manually add the effects from the player table (remember to subtract the P2 effect, not add it).

Search:

Deck components		Mean Player 1 win log-odds effect
Player 1	Player 2	Mean Player 1 win log-odds effect
		-0.398653255378251 0.266980131954750
Black	Black	0.002
Black	Neutral	0.036
Black	Purple	-0.116
Black	Purplev2	0.007
Black	Red	0.231
Black	White	0.234
Black	Anarchy	-0.024
Black	AnarchyP22	-0.013
Black	Anarchyv2	0.008
Black	Balance	0.015
Black	Bashing	0.118
Black	Blood	0.031
Black	DemonBash	-0.009
Black	Demonology	0.072
Black	DemonologyP22	0.020
Black	Demonologyv2	0.010
Player 1	Player 2	Total: 3.95; P1 win prob: 98.1%

Showing 1 to 16 of 1,369 entries

Previous1 2 3 4 5…86Next

Monocolour matchups

Since we’re most interested in whether monocolour decks are reasonably balanced, here are matchup results for the monocolour decks. The three black vertical lines in each plot facet show the matchup quartiles.

Original cards
Forum standard cards v2.1

Search:

P1	P2	P1 win probability	matchup	fairness
		0.225238401591459 0.828405871283545		0.343188257432911 0.992672198901529
MonoBlack	MonoBlack	0.537	5.4-4.6	0.93
MonoBlue	MonoBlack	0.340	3.4-6.6	0.68
MonoGreen	MonoBlack	0.225	2.3-7.7	0.45
MonoPurple	MonoBlack	0.361	3.6-6.4	0.72
MonoRed	MonoBlack	0.580	5.8-4.2	0.84
MonoWhite	MonoBlack	0.600	6.0-4.0	0.80
MonoBlack	MonoBlue	0.591	5.9-4.1	0.82
MonoBlue	MonoBlue	0.504	5.0-5.0	0.99
MonoGreen	MonoBlue	0.828	8.3-1.7	0.34
MonoPurple	MonoBlue	0.288	2.9-7.1	0.58

Showing 1 to 10 of 36 entries

Previous1 2 3 4Next

We can also average over a deck’s performance when going first and when going second, to see how the general matchups look:

Search:

P1	P2	P1 win probability	matchup	fairness
		0.396961893620102 0.725791556442316		0.548416887115368 1.000000000000000
MonoBlack	MonoBlack	0.500	5.0-5.0	1.00
MonoBlack	MonoBlue	0.626	6.3-3.7	0.75
MonoBlack	MonoGreen	0.726	7.3-2.7	0.55
MonoBlack	MonoPurple	0.613	6.1-3.9	0.77
MonoBlack	MonoRed	0.511	5.1-4.9	0.98
MonoBlack	MonoWhite	0.605	6.1-3.9	0.79
MonoBlue	MonoBlue	0.500	5.0-5.0	1.00
MonoBlue	MonoGreen	0.443	4.4-5.6	0.89
MonoBlue	MonoPurple	0.619	6.2-3.8	0.76
MonoBlue	MonoRed	0.397	4.0-6.0	0.79

Showing 1 to 10 of 21 entries

Previous1 2 3Next

Finally, we can average over P1’s performance instead, showing us how dependent a matchup is on who goes first:

Search:

P1	P2	P1 win probability	matchup	fairness
		0.291786154818827 0.771836568298946		0.456326863402108 0.992672198901529
MonoBlack	MonoBlack	0.537	5.4-4.6	0.93
MonoBlack	MonoBlue	0.466	4.7-5.3	0.93
MonoBlack	MonoGreen	0.451	4.5-5.5	0.90
MonoBlack	MonoPurple	0.474	4.7-5.3	0.95
MonoBlack	MonoRed	0.591	5.9-4.1	0.82
MonoBlack	MonoWhite	0.705	7.1-2.9	0.59
MonoBlue	MonoBlue	0.504	5.0-5.0	0.99
MonoBlue	MonoGreen	0.772	7.7-2.3	0.46
MonoBlue	MonoPurple	0.407	4.1-5.9	0.81
MonoBlue	MonoRed	0.568	5.7-4.3	0.86

Showing 1 to 10 of 21 entries

Previous1 2 3Next

Search:

P1	P2	P1 win probability	matchup	fairness
		0.349488591338635 0.779417775303092		0.441164449393816 0.994448819754131
MonoBlackv2	MonoBlackv2	0.452	4.5-5.5	0.90
MonoBlue	MonoBlackv2	0.513	5.1-4.9	0.97
MonoGreenv2	MonoBlackv2	0.465	4.6-5.4	0.93
MonoPurplev2	MonoBlackv2	0.487	4.9-5.1	0.97
MonoRedv2	MonoBlackv2	0.562	5.6-4.4	0.88
MonoWhitev2	MonoBlackv2	0.509	5.1-4.9	0.98
MonoBlackv2	MonoBlue	0.505	5.1-4.9	0.99
MonoBlue	MonoBlue	0.504	5.0-5.0	0.99
MonoGreenv2	MonoBlue	0.779	7.8-2.2	0.44
MonoPurplev2	MonoBlue	0.349	3.5-6.5	0.70

Showing 1 to 10 of 36 entries

Previous1 2 3 4Next

We can also average over a deck’s performance when going first and when going second, to see how the general matchups look:

Search:

P1	P2	P1 win probability	matchup	fairness
		0.439602140534722 0.608786230002689		0.782427539994622 1.000000000000000
MonoBlackv2	MonoBlackv2	0.500	5.0-5.0	1.00
MonoBlackv2	MonoBlue	0.496	5.0-5.0	0.99
MonoBlackv2	MonoGreenv2	0.519	5.2-4.8	0.96
MonoBlackv2	MonoPurplev2	0.548	5.5-4.5	0.90
MonoBlackv2	MonoRedv2	0.445	4.4-5.6	0.89
MonoBlackv2	MonoWhitev2	0.533	5.3-4.7	0.93
MonoBlue	MonoBlue	0.500	5.0-5.0	1.00
MonoBlue	MonoGreenv2	0.447	4.5-5.5	0.89
MonoBlue	MonoPurplev2	0.599	6.0-4.0	0.80
MonoBlue	MonoRedv2	0.440	4.4-5.6	0.88

Showing 1 to 10 of 21 entries

Previous1 2 3Next

Finally, we can average over P1’s performance instead, showing us how dependent a matchup is on who goes first:

Search:

P1	P2	P1 win probability	matchup	fairness
		0.367516587120386 0.726279865569087		0.547440268861826 0.992079980527049
MonoBlackv2	MonoBlackv2	0.452	4.5-5.5	0.90
MonoBlackv2	MonoBlue	0.509	5.1-4.9	0.98
MonoBlackv2	MonoGreenv2	0.484	4.8-5.2	0.97
MonoBlackv2	MonoPurplev2	0.535	5.3-4.7	0.93
MonoBlackv2	MonoRedv2	0.507	5.1-4.9	0.99
MonoBlackv2	MonoWhitev2	0.543	5.4-4.6	0.91
MonoBlue	MonoBlue	0.504	5.0-5.0	0.99
MonoBlue	MonoGreenv2	0.726	7.3-2.7	0.55
MonoBlue	MonoPurplev2	0.449	4.5-5.5	0.90
MonoBlue	MonoRedv2	0.557	5.6-4.4	0.89

Showing 1 to 10 of 21 entries

Previous1 2 3Next

Model variances

Each type of component in the model has a different variance in the effect; inference for the variances is also done in the model simulation. The below plot shows the variances for each component type, scaled by how many such components go into a matchup, i.e. two player skill components, one starter vs. starter component, six starter vs. spec / spec vs. starter components, and nine spec vs. spec components.

On average, total player skill effects on match outcome are about 3.03 as variable as total deck effects. This is a rough measure of how important to a match the players are, compared to the decks.

Codex model

Mark Webster

Page last updated 2022-09-25
Data last updated 2022-09-14

Simulation configuration

Posterior model performance

Player skill

Opposed component effects

Monocolour matchups

Model variances

Codex model

Mark Webster

Page last updated 2022-09-25 Data last updated 2022-09-14

Simulation configuration

Posterior model performance

Player skill

Opposed component effects

Monocolour matchups

Model variances

Page last updated 2022-09-25
Data last updated 2022-09-14