Synchronous Development in Open-Source Projects: A Higher-Level Perspective

Complete Material

Downloads

Website Content

Research Questions and Hypotheses
Main Hypotheses
- Hypothesis H1 (4 tables, 3 figures)
- Hypothesis H2 (3×4×8 figures)
- Hypothesis H3 (3×2 tables)
Sub-Hypotheses
- Hypothesis H1.1 (2×3 tables)
- Hypothesis H1.2 (3 tables)
- Hypothesis H1.3 (3 tables)
- Hypothesis H1.4 (3 tables)
- Hypothesis H3.1 (3×3 tables)

Research Questions and Hypotheses

Research Questions

RQ1: Which abstraction level of the source code captures the collaboration of developers best: files or features?
That is, which of the two abstraction levels of the source code leads to identifying a stronger statistical dependence between technical activities of developer pairs?
RQ2: Which abstraction level of the mailing list captures the coordination of developers best: message-based communication or conversation-based communication?
That is, which of the two abstraction levels of the mailing list leads to identifying a stronger statistical dependence between technical activities and social activities on the mailing list?

Hypotheses

Hypotheses Related to C-Bursts

H1

Feature-based collaboration captures developer collaboration more accurately than file-based collaboration.

H1.1

The number of empirical C-bursts is higher than the number of simulated C-bursts.

H1.2

The synchronicity degree of empirical C-bursts is higher than the one of simulated C-bursts.

H1.3

The code growth ∆L is higher in synchronous commits than in non-synchronous commits.

H1.4

The implementation effort ∆W is lower in synchronous commits than in non-synchronous commits.
To evaluate H1, we evaluate the following four sub-hypotheses:

H1+H1.1

The number of empirical feature-based C-bursts is higher than the number of empirical file-based C-bursts.

H1+H1.2

The synchronicity degree of empirical feature-based C-bursts is higher than the synchronicity degree of empirical file-based C-bursts.

H1+H1.3

The code growth ∆L in synchronous commits is higher on features than on files.

H1+H1.4

The implementation effort ∆W in synchronous commits is lower on features than on files.
Hypotheses Related to E-Bursts

H2

Conversation-based communication captures developer coordination more accurately than message-based communication.

H2.1

The relation between the number of C-bursts and the number of E-bursts is described by a linear relationship.
To evaluate H2, we evaluate the following sub-hypothesis:

H2+H2.1

The linear relation between the number of C-bursts and the number of E-bursts has a higher goodness of fit for conversation-based E-bursts than for message-based E-bursts.
Hypotheses Related to C-Bursts and E-Bursts

H3

The temporal correlation between C-bursts and E-bursts is higher for feature-based C-bursts than for file-based C-bursts and higher for conversation-based E-bursts than for message-based E-bursts.

H3.1

C-bursts and E-bursts are temporally correlated, that is, the DTW distances between empirical C-curves and empirical E-curves are smaller than between simulated curves.
To evaluate H3, we evaluate the following sub-hypothesis:

H3+H3.1

C-bursts and E-bursts are temporally correlated, that is, the DTW distances between empirical C-curves and empirical E-curves are smaller than between simulated curves.

Statistical Tests for Main Hypotheses

Content

Hypothesis H1 (4 tables, 3 figures)
Hypothesis H2 (3×2×8 figures)
Hypothesis H3 (3×2 tables)

Hypothesis H1

(Jump to comparison regarding H1.1 | H1.2 | H1.3 | H1.4)

**Hypothesis H1+H1.1.** Comparing abstraction levels regarding **H1.1**. A p-value < 0.05 means that the empirical numbers of C-bursts per developer pair of the feature-based analysis are significantly higher than the corresponding empirical numbers of C-bursts per developer pair of the file-based analysis. (One-tailed Mann-Whitney U test with FDR correction, alternative hypothesis: feature > file)
feature > file
ξ	U	p-value	Cliff's Delta
BusyBox
1	135485888	0.2028024	0.0006
5	135266090	0.0095987932	0.0023
10	135194592	0.0036622953	0.0028
15	135199201	0.006504682	0.0028
OpenSSL
1	35673033	7.246036e-14	0.0126
5	35367796	3.616927e-18	0.0211
10	35122324	3.910268e-24	0.0279
15	34969476	1.707343e-27	0.0321
QEMU
1	25891734214	1.000092e-48	0.0028
5	25746439556	2.888522e-183	0.0084
10	25612361104	0	0.0136
15	25515286716	0	0.0173

Synchronicity-degree comparison for BusyBox

Synchronicity-degree comparison for OpenSSL

Synchronicity-degree comparison for QEMU

**Hypothesis H1+H1.2.** Comparing abstraction levels regarding **H1.2**. A p-value < 0.05 means that the empirical synchronicity degrees of the feature-based analysis are significantly higher than the empirical synchronicity degrees of the file-based analysis. (One-tailed Mann-Whitney U test with FDR correction, alternative hypothesis: feature > file)
feature > file
ξ	U	p-value	Cliff's Delta
BusyBox
1	15544.5	0.6187063	-0.0200
5	45281.5	0.3691557	0.0164
10	57475	0.4313845	0.0080
15	62482	0.5603974	0.0488
OpenSSL
1	47968.5	0.0001996878	0.1622
5	202911	9.488344e-12	0.2073
10	298943.5	2.572438e-11	0.1837
15	332521	1.326538e-12	0.1902
QEMU
1	681839.5	3.757765e-40	0.2984
5	3949898	4.786402e-210	0.4257
10	8290824	0	0.4864
15	12049580	0	0.5135

**Hypothesis H1+H1.3.** Comparing abstraction levels regarding **H1.3**. A p-value < 0.05 means that the code growth ΔL in the synchronous commits of the feature-based analysis is significantly higher than the code growth ΔL in the synchronous commits of the file-based analysis. (One-tailed Mann-Whitney U test with FDR correction, alternative hypothesis: feature > file)
feature > file
ξ	U	p-value	Cliff's Delta
BusyBox
1	114081	0.1967877	0.0608
5	689731.5	0.02884461	0.0589
10	1350738	0.0001153576	0.0836
15	1890074	1.298514e-05	0.0871
OpenSSL
1	310536	0.9539756	-0.0029
5	1658454	0.9885068	-0.0112
10	3070373	0.989393	-0.0026
15	4241030	0.9916779	0.0053
QEMU
1	7273482	1	-0.0366
5	29462292	1	-0.0089
10	48855197	1	0.0036
15	61288970	0.3023295	0.0110

**Hypothesis H1+H1.4.** Comparing abstraction levels regarding **H1.4**. A p-value < 0.05 means that the code effort ΔW in the synchronous commits of the feature-based analysis is significantly lower than the code effort ΔW in the synchronous commits of the file-based analysis. (One-tailed Mann-Whitney U test with FDR correction, alternative hypothesis: feature < file)
feature < file
ξ	U	p-value	Cliff's Delta
BusyBox
1	123691	0.903862	-0.0183
5	714858.5	0.4960634	0.0246
10	1393609	0.0154136	0.0545
15	1969254	0.01952964	0.0488
OpenSSL
1	320524	0.98942	-0.0352
5	1699802	0.9999997	-0.0364
10	3186181	1	-0.0405
15	4384574	1	-0.0284
QEMU
1	7013639	1	0.0004
5	28321505	1	0.0301
10	46651924	1.161575e-07	0.0485
15	58353021	1	0.0584

Hypothesis 2

(Jump to data regarding BusyBox | OpenSSL | QEMU)

BusyBox

Message-based • lower-bound approach

Conversation-based • lower-bound approach

Message-based • upper-bound approach

Conversation-based • upper-bound approach

OpenSSL

Message-based • lower-bound approach

Conversation-based • lower-bound approach

Message-based • upper-bound approach

Conversation-based • upper-bound approach

QEMU

Message-based • lower-bound approach

Conversation-based • lower-bound approach

Message-based • upper-bound approach

Conversation-based • upper-bound approach

Hypothesis H3

(Jump to results of H3 using smoothing parameter and band-window size 1ξ | 2ξ | 3ξ)

Smoothing parameter and band-window size: 1ξ

(Jump to comparison regarding artifact abstraction | mail abstraction)

**Hypothesis H3+H3.1 (Smoothing parameter and band-window size: 1ξ).** Testing **H3.1** for different abstraction levels. A p-value < 0.05 means that the empirical DTW distances of the feature-based analysis are significantly smaller than the corresponding empirical DTW distances of the file-based analysis. (One-tailed Mann-Whitney U test with FDR correction, alternative hypothesis: feature < file)
feature < file
ξ	mailing-list abstraction	U	p-value	Cliff's Delta
BusyBox • lower-bound approach
1	message	29.5	0.1907301	-0.4048
5	message	139.5	0.1811315	-0.3413
10	message	266.5	0.1820859	-0.2690
15	message	346	0.339109	-0.1829
1	conversation	17	0.7147429	0.1905
5	conversation	141	0.2041459	-0.3558
10	conversation	209	0.3007771	-0.2667
15	conversation	287.5	0.5288958	-0.1735
OpenSSL • lower-bound approach
1	message	--	--	--
5	message	0	0.9772499	1.0000
10	message	5	0.8067619	0.1667
15	message	8	0.5	-0.0667
1	conversation	--	--	--
5	conversation	--	--	--
10	conversation	--	--	--
15	conversation	--	--	--
QEMU • lower-bound approach
1	message	114530.5	0.03992912	-0.0697
5	message	1036377.0	3.509142e-12	-0.1572
10	message	2513288	2.837048e-07	-0.0901
15	message	4050886	4.012332e-08	-0.0856
1	conversation	21600.0	0.155811	-0.0634
5	conversation	90444.5	0.05896802	-0.0681
10	conversation	189474.0	0.1613079	-0.0352
15	conversation	273238.5	0.1803632	-0.0296
BusyBox • upper-bound approach
1	message	1762.5	0.2869297	-0.0633
5	message	3149.5	0.912163	0.1229
10	message	4326.5	0.9658731	0.1517
15	message	5293.5	0.97921	0.1608
1	conversation	1160.5	0.6973924	0.0622
5	conversation	2415	0.9615439	0.1692
10	conversation	2930.5	0.9852391	0.1967
15	conversation	3547.5	0.9883342	0.1956
OpenSSL • upper-bound approach
1	message	2372.5	0.3346254	-0.0429
5	message	9334	0.9078902	0.0911
10	message	14814.5	0.8931269	0.0762
15	message	20733	0.9114269	0.0761
1	conversation	966.5	0.3451975	-0.0505
5	conversation	2662.5	0.7336254	0.0589
10	conversation	3956.5	0.8085630	0.0744
15	conversation	5465.5	0.6788902	0.0368
QEMU • upper-bound approach
1	message	360931.0	0.46343033	-0.0027
5	message	2117556	3.646642e-01	-0.0063
10	message	4843092	9.326571e-01	0.0221
15	message	7656084	9.376598e-01	0.0202
1	conversation	196781.0	0.26515260	-0.0215
5	conversation	883089.0	0.935644957	0.0348
10	conversation	1731948	0.99999715	0.0862
15	conversation	2424468	1.000000	0.1040

**Hypothesis H3+H3.1 (Smoothing parameter and band-window size: 1ξ).** Testing **H3.1** for different abstraction levels. A p-value < 0.05 means that the empirical DTW distances of the conversation-based analysis are significantly smaller than the corresponding empirical DTW distances of the message-based analysis. (One-tailed Mann-Whitney U test with FDR correction, alternative hypothesis: conversation < message)
conversation < message
ξ	artifact abstraction	U	p-value	Cliff's Delta
BusyBox • lower-bound approach
1	file	102.5	0.5910117	-0.0459
5	file	334	0.4744592	0.0118
10	file	527	0.5131239	-0.0038
15	file	696	0.5602421	-0.0198
1	feature	6.5	0.8658571	-0.4444
5	feature	31	0.4790301	0.0312
10	feature	72	0.655512	-0.0909
15	feature	108.5	0.5693267	-0.0333
OpenSSL • lower-bound approach
1	file	--	--	--
5	file	1.5	0.9213504	-0.5000
10	file	2	0.6381632	0.0000
15	file	3	0.7209077	-0.2000
1	feature	--	--	--
5	feature	--	--	--
10	feature	--	--	--
15	feature	--	--	--
QEMU • lower-bound approach
1	file	168075	1	-0.2738
5	file	916742.5	1	-0.3503
10	file	2064022	1	-0.3727
15	file	3145634	1	-0.3829
1	feature	21100	0.9999935	-0.2801
5	feature	162799.5	1	-0.4574
10	feature	407885	1	-0.4534
15	feature	637096	1	-0.4635
BusyBox • upper-bound approach
1	file	3100.5	0.3837009	0.0273
5	file	4784	0.01238102	0.1772
10	file	6646.5	0.02616465	0.1426
15	file	8053.5	0.01127874	0.1590
1	feature	722	0.8140288	-0.1220
5	feature	1496	0.05805552	0.1668
10	feature	2116	0.115457	0.1183
15	feature	2556	0.1009288	0.1201
OpenSSL • upper-bound approach
1	file	1416	0.1373853	0.1205
5	file	5844	0.0545929	0.1255
10	file	8401	0.002500195	0.1964
15	file	11141	0.0007705685	0.2065
1	feature	1145	0.1539802	0.1192
5	feature	3498	0.01217159	0.1953
10	feature	5140	0.002939712	0.2161
15	feature	6770	0.0002303006	0.2535
QEMU • upper-bound approach
1	file	525906	0.0001413022	0.0907
5	file	1532476	5.566383e-120	0.4019
10	file	3059821	3.80131e-199	0.4328
15	file	4468298	1.332894e-250	0.4415
1	feature	107555	0.002878516	0.1029
5	feature	494158	1.329499e-47	0.3422
10	feature	1127558	1.01005e-74	0.3520
15	feature	1749509	5.353178e-84	0.3380

Smoothing parameter and band-window size: 2ξ

(Jump to comparison regarding artifact abstraction | mail abstraction)

**Hypothesis H3+H3.1 (Smoothing parameter and band-window size: 2ξ).** Testing **H3.1** for different abstraction levels. A p-value < 0.05 means that the empirical DTW distances of the feature-based analysis are significantly smaller than the corresponding empirical DTW distances of the file-based analysis. (One-tailed Mann-Whitney U test with FDR correction, alternative hypothesis: feature < file)
feature < file
ξ	mailing-list abstraction	U	p-value	Cliff's Delta
BusyBox • lower-bound approach
1	message	31.5	0.2344606	-0.50000000
5	message	143	0.123247	-0.37500000
10	message	262.5	0.15337	-0.25000000
15	message	339.0	0.2807124	-0.15897436
1	conversation	31	0.2586764	-0.47619048
5	conversation	141	0.1599226	-0.35576923
10	conversation	218.5	0.1701726	-0.32424242
15	conversation	302.0	0.3166841	-0.23265306
OpenSSL • lower-bound approach
1	message	--	--	--
5	message	0	0.9772499	1
10	message	4	0.8116204	0.3333333
15	message	7	0.6172028	0.06666667
1	conversation	--	--	--
5	conversation	--	--	--
10	conversation	--	--	--
15	conversation	--	--	--
QEMU • lower-bound approach
1	message	126005.5	4.470506e-06	-0.1768681
5	message	980110.0	1.921303e-05	-0.09437354
10	message	2418520	0.003257534	-0.04902169
15	message	3864428	0.01271414	-0.03565929
1	conversation	21549.5	0.1656193	-0.06087235
5	conversation	89128.5	0.1138065	-0.05253307
10	conversation	185452.5	0.3550445	-0.01323277
15	conversation	266484.0	0.4489555	-0.00415818
BusyBox • upper-bound approach
1	message	1674	0.4656921	-0.009954751
5	message	3125.0	0.9235347	0.12976887
10	message	4318.0	0.9673609	0.15333333
15	message	5306.5	0.9778691	0.15876665
1	conversation	1209	0.576684	0.023030303
5	conversation	2402.5	0.965145	0.17354661
10	conversation	2915	0.9869064	0.200932018
15	conversation	3579.5	0.9855139	0.188321995
OpenSSL • upper-bound approach
1	message	2277	0.4973648	-0.0008791209
5	message	9292	0.9173539	0.09514071
10	message	14704	0.9123975	0.08312028
15	message	20543.5	0.9331116	0.08451426
1	conversation	920.0	0.5017270	2.902951e-18
5	conversation	2663.5	0.7323969	0.058501237
10	conversation	3945.5	0.8166709	0.076968066
15	conversation	5267.0	0.8172146	0.07181249
QEMU • upper-bound approach
1	message	370583.0	0.15409065	-0.02946871
5	message	2103508	5.071624e-01	0.0003291037
10	message	4815039	9.699130e-01	0.02775097
15	message	7600899	9.808686e-01	0.02727014
1	conversation	197184.5	0.245545342	-0.02362252
5	conversation	876425	0.966939181	0.04203912
10	conversation	1709122	0.9999999	0.09822544
15	conversation	2384348	1.0000000	0.118875992

**Hypothesis H3+H3.1 (Smoothing parameter and band-window size: 2ξ).** Testing **H3.1** for different abstraction levels. A p-value < 0.05 means that the empirical DTW distances of the conversation-based analysis are significantly smaller than the corresponding empirical DTW distances of the message-based analysis. (One-tailed Mann-Whitney U test with FDR correction, alternative hypothesis: conversation < message)
conversation < message
ξ	artifact abstraction	U	p-value	Cliff's Delta
BusyBox • lower-bound approach
1	file	94.5	0.4451449	0.0357
5	file	335	0.4817489	0.0089
10	file	549.5	0.6289225	-0.0467
15	file	715.5	0.6415967	-0.0484
1	feature	3.5	0.412389	0.2222
5	feature	32	0.5209699	0.0000
10	feature	61	0.3908018	0.0758
15	feature	109.5	0.5864227	-0.0429
OpenSSL • lower-bound approach
1	file	--	--	--
5	file	1.5	0.9213504	-0.5000
10	file	2	0.6381632	0.0000
15	file	3	0.7209077	-0.2000
1	feature	--	--	--
5	feature	--	--	--
10	feature	--	--	--
15	feature	--	--	--
QEMU • lower-bound approach
1	file	176881	1	-0.3406
5	file	938406.5	1	-0.3822
10	file	2074267	1	-0.3795
15	file	3166391	1	-0.3920
1	feature	23460.5	1	-0.4233
5	feature	160108.5	1	-0.4333
10	feature	408942	1	-0.4572
15	feature	630181	1	-0.4476
BusyBox • upper-bound approach
1	file	2846.5	0.1221561	0.1070
5	file	4960.5	0.03142918	0.1468
10	file	6873.5	0.06156174	0.1133
15	file	8514	0.05583297	0.1109
1	feature	582	0.2452866	0.0956
5	feature	1575	0.1238102	0.1228
10	feature	2227	0.2330175	0.0721
15	feature	2712	0.2404133	0.0664
OpenSSL • upper-bound approach
1	file	1313.5	0.04740422	0.1842
5	file	6017	0.1017992	0.0997
10	file	8839	0.01361712	0.1545
15	file	11710	0.00546155	0.1660
1	feature	1060	0.05701201	0.1846
5	feature	3575	0.02031265	0.1776
10	feature	5414	0.01315417	0.1743
15	feature	7339	0.004197322	0.1908
QEMU • upper-bound approach
1	file	399017	1.153492e-35	0.3101
5	file	1573106	7.068196e-111	0.3861
10	file	3239574	5.150017e-170	0.3995
15	file	4809009	4.894826e-205	0.3989
1	feature	85737.5	1.049393e-14	0.2849
5	feature	506895.5	3.343159e-43	0.3253
10	feature	1205634	2.149876e-57	0.3071
15	feature	1894614	1.435514e-59	0.2831

Smoothing parameter and band-window size: 3ξ

(Jump to comparison regarding artifact abstraction | mail abstraction)

**Hypothesis H3+H3.1 (Smoothing parameter and band-window size: 3ξ).** Testing **H3.1** for different abstraction levels. A p-value < 0.05 means that the empirical DTW distances of the feature-based analysis are significantly smaller than the corresponding empirical DTW distances of the file-based analysis. (One-tailed Mann-Whitney U test with FDR correction, alternative hypothesis: feature < file)
feature < file
ξ	mailing-list abstraction	U	p-value	Cliff's Delta
BusyBox • lower-bound approach
1	message	30.5	0.2344606	-0.4524
5	message	149	0.1023692	-0.4327
10	message	266	0.1317595	-0.2667
15	message	342.5	0.2543151	-0.1709
1	conversation	31.5	0.1890857	-0.5000
5	conversation	143	0.1469495	-0.3750
10	conversation	227	0.1054986	-0.3758
15	conversation	277.0	0.4089362	-0.1306
OpenSSL • lower-bound approach
1	message	--	--	--
5	message	0	0.9772499	1.0000
10	message	4	0.8116204	0.3333
15	message	7	0.6172028	0.0667
1	conversation	--	--	--
5	conversation	--	--	--
10	conversation	--	--	--
15	conversation	--	--	--
QEMU • lower-bound approach
1	message	127734.5	6.275401e-07	-0.1930
5	message	964266.0	0.0004111805	-0.0767
10	message	2394268	0.01630389	-0.0385
15	message	3899014	0.002433373	-0.0449
1	conversation	21481.5	0.1792627	-0.0575
5	conversation	87867.5	0.1936604	-0.0376
10	conversation	183296.0	0.4837749	-0.0015
15	conversation	265513.5	0.4938516	-0.0005
BusyBox • upper-bound approach
1	message	1662.0	0.4914131	-0.0027
5	message	3099	0.9343399	0.1370
10	message	4238	0.9789136	0.1690
15	message	5314.5	0.977008	0.1575
1	conversation	1199.5	0.6013256	0.0307
5	conversation	2386.5	0.9693453	0.1791
10	conversation	2965.0	0.980861	0.1872
15	conversation	3557.5	0.9875089	0.1933
OpenSSL • upper-bound approach
1	message	2271	0.5079052	0.0018
5	message	9205.0	0.9346033	0.1036
10	message	14523	0.9381803	0.0944
15	message	20332.5	0.9521578	0.0939
1	conversation	908.0	0.5430922	0.0130
5	conversation	2649.5	0.7493272	0.0634
10	conversation	3858.5	0.8730878	0.0973
15	conversation	5278.0	0.8106805	0.0699
QEMU • upper-bound approach
1	message	373791.5	0.092207855	0.0922
5	message	2096468	5.793862e-01	0.0037
10	message	4801986	9.802064e-01	0.0304
15	message	7571306	9.908571e-01	0.0311
1	conversation	196211.5	0.294141462	-0.0186
5	conversation	872148.5	0.979420225	0.0467
10	conversation	1696236	0.99999998	0.1050
15	conversation	2362834	1.000000	0.1268

**Hypothesis H3+H3.1 (Smoothing parameter and band-window size: 3ξ).** Testing **H3.1** for different abstraction levels. A p-value < 0.05 means that the empirical DTW distances of the conversation-based analysis are significantly smaller than the corresponding empirical DTW distances of the message-based analysis. (One-tailed Mann-Whitney U test with FDR correction, alternative hypothesis: conversation < message)
conversation < message
ξ	artifact abstraction	U	p-value	Cliff's Delta
BusyBox • lower-bound approach
1	file	99.5	0.5366444	-0.0153
5	file	335.5	0.4853985	0.0074
10	file	557	0.6655742	-0.0610
15	file	710.5	0.6211794	-0.0410
1	feature	2.5	0.2532776	0.4444
5	feature	35	0.6436064	-0.0938
10	feature	70.5	0.6210005	-0.0682
15	feature	127	0.8370049	-0.2095
OpenSSL • lower-bound approach
1	file	--	--	--
5	file	1	0.7298543	0.0000
10	file	2	0.6381632	0.0000
15	file	4	0.8792167	-0.6000
1	feature	--	--	--
5	feature	--	--	--
10	feature	--	--	--
15	feature	--	--	--
QEMU • lower-bound approach
1	file	172707.5	1	-0.3090
5	file	944118.5	1	-0.3907
10	file	2090894	1	-0.3905
15	file	3184026	1	-0.3998
1	feature	22989	1	-0.3947
5	feature	162067.5	1	-0.4508
10	feature	411195.5	1	-0.4652
15	feature	632020.5	1	-0.4518
BusyBox • upper-bound approach
1	file	2801.5	0.09372704	0.1211
5	file	4973.5	0.03348719	0.1446
10	file	6990.5	0.09071909	0.0982
15	file	8710	0.09731117	0.0904
1	feature	563.5	0.1844637	0.1243
5	feature	1579	0.1281613	0.1206
10	feature	2201	0.2007864	0.0829
15	feature	2756	0.2932769	0.0513
OpenSSL • upper-bound approach
1	file	1315	0.04824589	0.1832
5	file	6074	0.1225216	0.0911
10	file	8966	0.02095059	0.1424
15	file	11868	0.008833537	0.1547
1	feature	1078	0.07192717	0.1708
5	feature	3623.5	0.02751554	0.1664
10	feature	5535	0.02349773	0.1559
15	feature	7417	0.005919076	0.1822
QEMU • upper-bound approach
1	file	364234.5	5.724665e-50	0.3703
5	file	1574941	1.785894e-110	0.3854
10	file	3268462	1.501843e-165	0.3941
15	file	4902068	2.157015e-193	0.3873
1	feature	80305	3.989165e-19	0.3302
5	feature	508927.5	1.605788e-42	0.3226
10	feature	1227856	6.853704e-53	0.2943
15	feature	1944635	3.833429e-52	0.2642

Statistical Tests for Sub-Hypotheses

Hypothesis H1.1 (2×3 tables)
Hypothesis H1.2 (3 tables)
Hypothesis H1.3 (3 tables)
Hypothesis H1.4 (3 tables)
Hypothesis H3.1 (3×3 tables)

Hypothesis H1.1

(Jump to data for BusyBox | OpenSSL | QEMU,
BusyBox (only pairs having a C-burst) | OpenSSL (only pairs having a C-burst) | QEMU (only pairs having a C-burst))

**Hypothesis H1.1.** Paired, one-tailed Wilcoxon signed rank test for comparing empirical and simulated number of C-bursts per developer pair for BusyBox. A p-value < 0.05 means that the number of empirical C-bursts per developer pair is significantly higher than the corresponding number of simulated C-bursts per developer pair.
BusyBox
ξ	mean ± std. dev. of the empirical number of C-bursts per developer pair	mean ± std. dev. of the simulated number of C-bursts per developer pair	V	p-value	r
feature
1	0.01 ± 0.20	0.00 ± 0.06	1321	8.597e-10	-0.0594
5	0.02 ± 0.38	0.01 ± 0.18	2927	3.166e-10	-0.0609
10	0.02 ± 0.39	0.01 ± 0.24	4511	6.111e-08	-0.0521
15	0.02 ± 0.40	0.01 ± 0.28	5780.5	3.24e-07	-0.0491
file
1	0.01 ± 0.34	0.00 ± 0.13	6213	< 2.2e-16	-0.0531
5	0.02 ± 0.48	0.01 ± 0.33	9034	2.656e-16	-0.0500
10	0.02 ± 0.47	0.01 ± 0.36	11769	1.728e-13	-0.0449
15	0.02 ± 0.40	0.01 ± 0.35	13247	4.773e-10	-0.0377

**Hypothesis H1.1.** Paired, one-tailed Wilcoxon signed rank test for comparing empirical and simulated number of C-bursts per developer pair for OpenSSL. A p-value < 0.05 means that the number of empirical C-bursts per developer pair is significantly higher than the corresponding number of simulated C-bursts per developer pair.
OpenSSL
ξ	mean ± std. dev. of the empirical number of C-bursts per developer pair	mean ± std. dev. of the simulated number of C-bursts per developer pair	V	p-value	r
feature
1	0.06 ± 0.72	0.02 ± 0.30	7216	1.353e-07	-0.0716
5	0.13 ± 1.33	0.07 ± 0.88	25084	2.183e-07	-0.0704
10	0.15 ± 1.54	0.11 ± 1.18	45655	0.0002217	-0.0489
15	0.16 ± 1.52	0.14 ± 1.29	61138	0.001974	-0.0401
file
1	0.03 ± 0.57	0.01 ± 0.20	4983	2.614e-16	-0.0685
5	0.06 ± 0.96	0.03 ± 0.58	21636	< 2.2e-16	-0.0903
10	0.07 ± 1.05	0.05 ± 0.78	24502	4.008e-15	-0.0656
15	0.07 ± 1.03	0.06 ± 0.86	26400	3.943e-11	-0.0549

**Hypothesis H1.1.** Paired, one-tailed Wilcoxon signed rank test for comparing empirical and simulated number of C-bursts per developer pair for QEMU. A p-value < 0.05 means that the number of empirical C-bursts per developer pair is significantly higher than the corresponding number of simulated C-bursts per developer pair.
QEMU
ξ	mean ± std. dev. of the empirical number of C-bursts per developer pair	mean ± std. dev. of the simulated number of C-bursts per developer pair	V	p-value	r
feature
1	0.01 ± 0.16	0.00 ± 0.14	257870	< 2.2e-16	-0.0395
5	0.02 ± 0.30	0.02 ± 0.30	1908400	< 2.2e-16	-0.0380
10	0.04 ± 0.34	0.03 ± 0.35	4973200	< 2.2e-16	-0.0360
15	0.04 ± 0.37	0.04 ± 0.38	8208900	< 2.2e-16	-0.0324
file
1	0.00 ± 0.12	0.00 ± 0.08	706200	2.2e-16	-0.0405
5	0.01 ± 0.24	0.01 ± 0.21	2948700	< 2.2e-16	-0.0448
10	0.02 ± 0.31	0.01 ± 0.28	5925700	< 2.2e-16	-0.0470
15	0.02 ± 0.34	0.02 ± 0.31	8491000	< 2.2e-16	-0.0475

**Hypothesis H1.1 (only pairs having, at least, one C-burst).** Paired, one-tailed Wilcoxon signed rank test for comparing empirical and simulated number of C-bursts per developer pair for BusyBox. A p-value < 0.05 means that the number of empirical C-bursts per developer pair is significantly higher than the corresponding number of simulated C-bursts per developer pair.
BusyBox
ξ	mean ± std. dev. of the empirical number of C-bursts per developer pair	mean ± std. dev. of the simulated number of C-bursts per developer pair	V	p-value	r
feature
1	1.96 ± 2.05	0.26 ± 0.80	1225	1.058e-10	-0.899
5	2.48 ± 3.59	0.74 ± 1.81	2927	1.094e-12	-0.785
10	2.34 ± 3.30	0.98 ± 2.27	3572	1.335e-12	-0.714
15	2.30 ± 3.20	1.05 ± 2.48	4347.5	6.879e-14	-0.722
file
1	2.80 ± 4.51	0.61 ± 1.92	5974	< 2.2e-16	-0.857
5	3.21 ± 5.65	1.54 ± 4.09	8276	< 2.2e-16	-0.758
10	2.99 ± 4.95	1.71 ± 4.09	9845.5	< 2.2e-16	-0.672
15	2.63 ± 3.89	1.64 ± 3.64	9721.5	< 2.2e-16	-0.599

**Hypothesis H1.1 (only pairs having, at least, one C-burst).** Paired, one-tailed Wilcoxon signed rank test for comparing empirical and simulated number of C-bursts per developer pair for OpenSSL. A p-value < 0.05 means that the number of empirical C-bursts per developer pair is significantly higher than the corresponding number of simulated C-bursts per developer pair.
OpenSSL
ξ	mean ± std. dev. of the empirical number of C-bursts per developer pair	mean ± std. dev. of the simulated number of C-bursts per developer pair	V	p-value	r
feature
1	2.95 ± 4.12	0.21 ± 0.78	5269	< 2.2e-16	-0.867
5	3.27 ± 6.01	0.58 ± 1.94	16974	< 2.2e-16	-0.812
10	3.05 ± 6.22	0.76 ± 2.59	26853	< 2.2e-16	-0.780
15	2.78 ± 5.71	0.83 ± 2.82	35681	< 2.2e-16	-0.766
file
1	3.52 ± 5.60	0.93 ± 2.04	4465	< 2.2e-16	-0.841
5	3.30 ± 6.58	1.62 ± 4.13	19662	< 2.2e-16	-0.767
10	3.02 ± 6.38	1.93 ± 4.86	19694	< 2.2e-16	-0.614
15	2.77 ± 5.86	1.98 ± 4.97	20038	< 2.2e-16	-0.522

**Hypothesis H1.1 (only pairs having, at least, one C-burst).** Paired, one-tailed Wilcoxon signed rank test for comparing empirical and simulated number of C-bursts per developer pair for QEMU. A p-value < 0.05 means that the number of empirical C-bursts per developer pair is significantly higher than the corresponding number of simulated C-bursts per developer pair.
QEMU
ξ	mean ± std. dev. of the empirical number of C-bursts per developer pair	mean ± std. dev. of the simulated number of C-bursts per developer pair	V	p-value	r
feature
1	1.46 ± 1.59	0.27 ± 1.49	204900	< 2.2e-16	-0.888
5	1.58 ± 1.88	0.25 ± 1.68	1302800	< 2.2e-16	-0.857
10	1.56 ± 1.67	0.19 ± 1.24	3165600	< 2.2e-16	-0.850
15	1.53 ± 1.60	0.18 ± 1.05	4968700	< 2.2e-16	-0.852
file
1	1.58 ± 1.51	0.47 ± 1.35	660460	< 2.2e-16	-0.822
5	1.86 ± 2.35	0.99 ± 2.49	2533500	< 2.2e-16	-0.705
10	1.90 ± 2.56	1.13 ± 2.65	4927400	< 2.2e-16	-0.659
15	1.89 ± 2.56	1.17 ± 2.61	6903100	< 2.2e-16	-0.642

Hypothesis H1.2

(Jump to data for BusyBox | OpenSSL | QEMU)

**Hypothesis H1.2.** One-tailed Mann-Whitney U test for comparing empirical and simulated synchronicity degrees for BusyBox. A p-value < 0.05 means that the empirical synchronicity degrees are significantly higher than the simulated synchronicity degrees.
BusyBox
ξ	mean ± std. dev. of empirical deg_sync	mean ± std. dev. of simulated deg_sync	U	p-value	Cliff's Delta
feature
1	0.60 ± 0.32	0.31 ± 0.26	199030	< 2.2e-16	0.5128
5	0.47 ± 0.33	0.26 ± 0.24	1267300	< 2.2e-16	0.3854
10	0.42 ± 0.31	0.24 ± 0.23	2182500	< 2.2e-16	0.3721
15	0.39 ± 0.31	0.23 ± 0.23	2776200	< 2.2e-16	0.3387
file
1	0.61 ± 0.35	0.32 ± 0.30	2416800	< 2.2e-16	0.4502
5	0.47 ± 0.35	0.25 ± 0.25	9497500	< 2.2e-16	0.3873
10	0.43 ± 0.34	0.23 ± 0.24	13510000	< 2.2e-16	0.3570
15	0.41 ± 0.34	0.22 ± 0.24	14568000	< 2.2e-16	0.3265

**Hypothesis H1.2.** One-tailed Mann-Whitney U test for comparing empirical and simulated synchronicity degrees for OpenSSL. A p-value < 0.05 means that the empirical synchronicity degrees are significantly higher than the simulated synchronicity degrees.
OpenSSL
ξ	mean ± std. dev. of empirical deg_sync	mean ± std. dev. of simulated deg_sync	U	p-value	Cliff's Delta
feature
1	0.51 ± 0.33	0.39 ± 0.29	2635200	3.578e-09	0.1927
5	0.43 ± 0.29	0.35 ± 0.27	17328000	2.19e-14	0.1729
10	0.38 ± 0.27	0.32 ± 0.25	29429000	3.81e-12	0.1419
15	0.37 ± 0.27	0.31 ± 0.25	36351000	1.819e-13	0.1467
file
1	0.42 ± 0.34	0.30 ± 0.28	3616500	2.146e-12	0.2095
5	0.34 ± 0.30	0.25 ± 0.24	24001000	2.2e-16	0.1797
10	0.32 ± 0.29	0.23 ± 0.23	40996000	< 2.2e-16	0.1829
15	0.30 ± 0.28	0.22 ± 0.22	51711000	< 2.2e-16	0.1833

**Hypothesis H1.2.** One-tailed Mann-Whitney U test for comparing empirical and simulated synchronicity degrees for QEMU. A p-value < 0.05 means that the empirical synchronicity degrees are significantly higher than the simulated synchronicity degrees.
QEMU
ξ	mean ± std. dev. of empirical deg_sync	mean ± std. dev. of simulated deg_sync	U	p-value	Cliff's Delta
feature
1	0.64 ± 0.31	0.60 ± 0.31	47697000	2.255e-05	0.0763
5	0.61 ± 0.31	0.58 ± 0.31	418200000	9.187e-06	0.0481
10	0.60 ± 0.31	0.57 ± 0.31	981100000	1.061e-09	0.0542
15	0.59 ± 0.31	0.57 ± 0.31	1509200000	2.926e-09	0.0473
file
1	0.47 ± 0.35	0.34 ± 0.31	169020000	< 2.2e-16	0.2071
5	0.37 ± 0.33	0.30 ± 0.28	1403200000	< 2.2e-16	0.1232
10	0.33 ± 0.31	0.27 ± 0.27	3296700000	< 2.2e-16	0.1098
15	0.31 ± 0.30	0.25 ± 0.26	5078600000	< 2.2e-16	0.0969

Hypothesis H1.3

(Jump to data for BusyBox | OpenSSL | QEMU)

**Hypothesis H1.3.** One-tailed Mann-Whitney U test for comparing code growth ΔL of synchronous and non-synchronous commits for BusyBox. A p-value < 0.05 means that the code growth ΔL in synchronous commits is significantly higher than the code growth ΔL in non-synchronous commits.
BusyBox
ξ	ΔL (mean ± std. dev.) in synchronous commits	ΔL (mean ± std. dev.) in non-synchronous commits	U	p-value	Cliff's Delta
feature
1	40.71 ± 217.16	72.90 ± 866.91	621380	0.1606	0.0366
5	152.54 ± 1,436.99	57.94 ± 702.23	1515300	0.1113	0.0287
10	143.84 ± 1,297.45	53.26 ± 688.30	2027500	0.01392	0.0452
15	136.44 ± 1,228.08	51.96 ± 692.07	2251200	0.006921	0.0480
file
1	34.28 ± 210.01	47.88 ± 683.84	4371200	0.3396	0.0081
5	47.07 ± 546.76	46.47 ± 679.66	8416800	0.5163	-0.0006
10	45.97 ± 497.10	46.87 ± 710.35	10410000	0.9621	-0.0222
15	54.17 ± 688.27	42.26 ± 633.61	11448000	0.9911	-0.0282

**Hypothesis H1.3.** One-tailed Mann-Whitney U test for comparing code growth ΔL of synchronous and non-synchronous commits for OpenSSL. A p-value < 0.05 means that the code growth ΔL in synchronous commits is significantly higher than the code growth ΔL in non-synchronous commits.
OpenSSL
ξ	ΔL (mean ± std. dev.) in synchronous commits	ΔL (mean ± std. dev.) in non-synchronous commits	U	p-value	Cliff's Delta
feature
1	52.89 ± 331.81	83.37 ± 1,357.07	1148900	0.9497	-0.0395
5	69.66 ± 408.72	83.27 ± 1,525.61	1968200	0.9165	-0.0256
10	65.75 ± 400.89	90.82 ± 1,710.95	2164700	0.859	-0.0190
15	112.25 ± 1,638.92	32.05 ± 129.24	2146900	0.5877	-0.0040
file
1	68.61 ± 578.95	64.72 ± 1,258.09	3095900	0.9043	-0.0263
5	57.87 ± 418.90	68.00 ± 1,388.58	6277300	0.4181	0.0030
10	111.12 ± 1,941.32	37.67 ± 203.15	7286000	0.484	0.0005
15	98.60 ± 1,781.98	38.37 ± 209.83	7679800	0.5056	-0.0002

**Hypothesis H1.3.** One-tailed Mann-Whitney U test for comparing code growth ΔL of synchronous and non-synchronous commits for QEMU. A p-value < 0.05 means that the code growth ΔL in synchronous commits is significantly higher than the code growth ΔL in non-synchronous commits.
QEMU
ξ	ΔL (mean ± std. dev.) in synchronous commits	ΔL (mean ± std. dev.) in non-synchronous commits	U	p-value	Cliff's Delta
feature
1	53.36 ± 310.37	78.06 ± 366.91	6548900	1	-0.0615
5	47.65 ± 269.63	96.74 ± 422.17	7994600	1	-0.1042
10	48.25 ± 262.21	108.91 ± 460.59	7356300	1	-0.1287
15	50.72 ± 267.30	111.97 ± 474.95	6895300	1	-0.1388
file
1	42.65 ± 732.06	39.24 ± 263.36	96963000	1.381e-13	0.0586
5	34.99 ± 515.49	42.77 ± 285.68	152310000	0.0001606	0.0226
10	34.23 ± 448.14	46.10 ± 308.76	158750000	0.2157	0.0048
15	32.98 ± 415.29	51.16 ± 339.91	147620000	0.9134	-0.0086

Hypothesis H1.4

(Jump to data for BusyBox | OpenSSL | QEMU)

**Hypothesis H1.4.** One-tailed Mann-Whitney U test for comparing code effort ΔW of synchronous and non-synchronous commits for BusyBox. A p-value < 0.05 means that the code effort ΔW in synchronous commits is significantly lower than the code effort ΔW in non-synchronous commits.
BusyBox
ξ	ΔW (mean ± std. dev.) in synchronous commits	ΔW (mean ± std. dev.) in non-synchronous commits	U	p-value	Cliff's Delta
feature
1	110.19 ± 256.55	189.86 ± 1,097.23	582280	0.2201	-0.0286
5	253.02 ± 1,470.09	174.73 ± 989.08	1496200	0.7474	0.0157
10	263.91 ± 1,403.50	166.38 ± 969.15	2024200	0.9826	0.0435
15	259.48 ± 1,335.70	163.93 ± 976.87	2215300	0.9452	0.0313
file
1	127.37 ± 362.98	140.35 ± 982.85	4655300	0.9999	0.0736
5	165.36 ± 1,366.64	132.18 ± 792.28	9114400	1	0.0823
10	162.38 ± 1,179.11	129.25 ± 820.71	11488000	1	0.0790
15	172.25 ± 1,222.08	120.15 ± 734.43	12707000	1	0.0786

**Hypothesis H1.4.** One-tailed Mann-Whitney U test for comparing code effort ΔW of synchronous and non-synchronous commits for OpenSSL. A p-value < 0.05 means that the code effort ΔW in synchronous commits is significantly lower than the code effort ΔW in non-synchronous commits.
OpenSSL
ξ	ΔW (mean ± std. dev.) in synchronous commits	ΔW (mean ± std. dev.) in non-synchronous commits	U	p-value	Cliff's Delta
feature
1	706.69 ± 14,256.52	121.61 ± 1,391.34	1335300	1	0.1164
5	400.57 ± 9,638.69	114.93 ± 1,560.62	2227900	1	0.1030
10	315.29 ± 8,170.89	119.53 ± 1,749.45	2440600	1	0.1060
15	333.69 ± 7,722.08	54.91 ± 147.70	2422700	1	0.1240
file
1	743.60 ± 16,919.80	107.34 ± 1,326.78	3667200	1	0.1533
5	382.26 ± 10,903.57	102.87 ± 1,432.18	7201600	1	0.1507
10	369.61 ± 9,619.07	67.96 ± 242.24	8425100	1	0.1570
15	323.46 ± 8,825.31	66.63 ± 242.61	8851400	1	0.1523

**Hypothesis H1.4.** One-tailed Mann-Whitney U test for comparing code effort ΔW of synchronous and non-synchronous commits for QEMU. A p-value < 0.05 means that the code effort ΔW in synchronous commits is significantly lower than the code effort ΔW in non-synchronous commits.
QEMU
ξ	ΔW (mean ± std. dev.) in synchronous commits	ΔW (mean ± std. dev.) in non-synchronous commits	U	p-value	Cliff's Delta
feature
1	118.55 ± 500.31	130.35 ± 438.24	6727800	0.005738	-0.0359
5	109.04 ± 423.50	146.46 ± 486.60	8453900	1.358e-05	-0.0527
10	107.70 ± 401.94	158.62 ± 529.27	7941400	2.102e-06	-0.0594
15	109.36 ± 400.10	162.00 ± 546.24	7518300	2.157e-06	-0.0609
file
1	87.90 ± 764.97	74.34 ± 318.52	99455000	1	0.0858
5	78.64 ± 563.80	75.55 ± 326.28	158850000	1	0.0666
10	75.28 ± 493.61	78.33 ± 349.17	166410000	1	0.0533
15	72.55 ± 460.57	83.60 ± 378.55	155570000	1	0.0448

Hypothesis H3.1

(Jump to results of H3.1 using smoothing parameter and band-window size 1ξ | 2ξ | 3ξ)

Smoothing parameter and band-window size: 1ξ

(Jump to data for BusyBox | OpenSSL | QEMU)

**Hypothesis H3.1.** One-tailed Mann-Whitney U test for comparing DTW distances for empirical and simulated bursts for BusyBox. A p-value < 0.05 means that the empirical DTW distances are significantly smaller than the simulated DTW distances. Smoothing parameter and band-window size: 1ξ
BusyBox
ξ	mailing-list abstraction	mean ± std. dev. of empirical DTW distances	mean ± std. dev. of simulated DTW distances	U	p-value	Cliff's Delta
feature • lower-bound approach
1	message	0.25 ± 0.19	0.55 ± 0.32	31.5	0.02809	-0.6613
5	message	0.12 ± 0.09	0.80 ± 0.41	12	8.058e-07	-0.9917
10	message	0.38 ± 1.01	0.87 ± 0.69	1079	3.432e-06	-0.7563
15	message	0.37 ± 0.88	0.94 ± 0.83	2369	2.147e-06	-0.6906
1	conversation	0.55 ± 0.27	0.66 ± 0.38	61	0.356	-0.1348
5	conversation	0.11 ± 0.07	0.80 ± 0.42	5	8.594e-07	-0.9949
10	conversation	0.42 ± 1.05	0.89 ± 0.64	491	2.697e-06	-0.8025
15	conversation	0.40 ± 0.91	0.97 ± 0.83	1190	1.547e-06	-0.7284
file • lower-bound approach
1	message	0.43 ± 0.32	0.88 ± 0.72	1369	0.004908	-0.4074
5	message	0.72 ± 1.25	1.90 ± 2.40	8089	2.363e-06	-0.5236
10	message	1.13 ± 2.94	1.90 ± 2.80	21368	7.74e-07	-0.4726
15	message	0.88 ± 2.19	1.95 ± 2.95	27716	8.219e-09	-0.5254
1	conversation	0.43 ± 0.38	1.07 ± 0.84	789.5	0.0007932	-0.5031
5	conversation	0.73 ± 1.29	2.47 ± 2.82	4256	8.945e-08	-0.6008
10	conversation	0.76 ± 1.38	2.64 ± 3.33	7673	8.53e-09	-0.6016
15	conversation	0.63 ± 1.08	2.67 ± 3.52	9958.5	1.836e-11	-0.6528
feature • upper-bound approach
1	message	6.74 ± 11.50	12.14 ± 17.06	23419	0.2708	-0.0573
5	message	13.31 ± 14.91	15.51 ± 18.25	109990	0.321	-0.0341
10	message	11.33 ± 13.44	11.97 ± 14.50	199510	0.3757	-0.0213
15	message	9.25 ± 10.89	9.98 ± 12.01	266430	0.2336	-0.0464
1	conversation	8.27 ± 11.70	17.53 ± 17.45	6978	0.001603	-0.3044
5	conversation	9.47 ± 12.11	18.89 ± 19.64	28781	1.49e-05	-0.3254
10	conversation	8.73 ± 11.90	16.52 ± 17.73	45411	4.634e-06	-0.3249
15	conversation	7.18 ± 9.64	14.65 ± 15.98	58123	6.51e-07	-0.3386
file • upper-bound approach
1	message	9.01 ± 14.76	10.51 ± 15.62	150470	0.4657	-0.0055
5	message	11.32 ± 14.89	12.78 ± 16.69	459040	0.06565	-0.0821
10	message	8.56 ± 11.70	9.43 ± 12.63	792920	0.01302	-0.1108
15	message	6.92 ± 9.38	7.77 ± 10.21	1052700	0.001046	-0.1448
1	conversation	7.54 ± 11.13	14.81 ± 17.31	43772	3.587e-05	-0.2709
5	conversation	7.91 ± 12.51	15.03 ± 18.24	114840	7.036e-09	-0.3292
10	conversation	6.37 ± 10.08	12.17 ± 15.47	175430	1.313e-09	-0.3260
15	conversation	5.08 ± 8.11	10.45 ± 13.54	220680	1.603e-11	-0.3454

**Hypothesis H3.1.** One-tailed Mann-Whitney U test for comparing DTW distances for empirical and simulated bursts for OpenSSL. A p-value < 0.05 means that the empirical DTW distances are significantly smaller than the simulated DTW distances. Smoothing parameter and band-window size: 1ξ
OpenSSL
ξ	mailing-list abstraction	mean ± std. dev. of empirical DTW distances	mean ± std. dev. of simulated DTW distances	U	p-value	Cliff's Delta
feature • lower-bound approach
1	message	-- ± --	-- ± --	--	--	--
5	message	1.49 ± --	1.24 ± 0.94	95	0.8289	0.5447
10	message	1.39 ± 1.08	1.53 ± 1.30	328	0.5391	0.0314
15	message	1.11 ± 1.07	1.69 ± 1.69	333	0.2536	-0.2238
1	conversation	-- ± --	-- ± --	--	--	--
5	conversation	-- ± --	-- ± --	--	--	--
10	conversation	-- ± --	-- ± --	--	--	--
15	conversation	-- ± --	-- ± --	--	--	--
file • lower-bound approach
1	message	0.51 ± --	0.74 ± 0.28	6	0.1481	-0.6471
5	message	0.57 ± 0.57	1.18 ± 0.72	104	0.156	-0.4190
10	message	1.00 ± 0.71	1.48 ± 1.02	462	0.1607	-0.2892
15	message	0.89 ± 0.72	1.56 ± 1.13	738	0.09502	-0.3411
1	conversation	-- ± --	-- ± --	--	--	--
5	conversation	1.14 ± --	1.11 ± 0.53	27	0.7179	0.4211
10	conversation	1.17 ± --	1.41 ± 0.75	28	0.477	-0.0508
15	conversation	1.36 ± --	1.44 ± 0.73	44	0.5671	0.0864
feature • upper-bound approach
1	message	6.63 ± 8.34	9.46 ± 12.61	168510	0.1669	-0.0696
5	message	9.77 ± 12.15	11.72 ± 14.87	739630	0.1316	-0.0579
10	message	7.98 ± 9.89	9.34 ± 11.90	1266200	0.1006	-0.0590
15	message	6.82 ± 8.30	7.88 ± 9.90	1805200	0.1412	-0.0456
1	conversation	3.42 ± 3.51	3.60 ± 3.78	58068	0.3198	-0.0430
5	conversation	4.52 ± 4.54	5.12 ± 5.11	191410	0.1981	-0.0593
10	conversation	3.97 ± 4.57	4.76 ± 5.08	284320	0.0441	-0.1086
15	conversation	3.45 ± 4.23	4.56 ± 4.93	351890	0.001909	-0.1705
file • upper-bound approach
1	message	6.79 ± 8.56	9.20 ± 12.30	206560	0.3156	-0.0333
5	message	8.14 ± 11.32	10.83 ± 14.20	1012300	0.004428	-0.1191
10	message	6.68 ± 9.11	8.61 ± 11.42	1765600	0.006031	-0.1023
15	message	5.77 ± 7.50	7.16 ± 9.36	2620000	0.01207	-0.0845
1	conversation	3.69 ± 3.66	3.80 ± 3.65	71590	0.2943	-0.0464
5	conversation	4.30 ± 4.81	5.15 ± 5.31	231950	0.02911	-0.1215
10	conversation	3.53 ± 4.67	4.86 ± 5.35	341420	0.0001486	-0.2071
15	conversation	3.25 ± 4.42	4.59 ± 5.23	445400	3.199e-05	-0.2147

**Hypothesis H3.1.** One-tailed Mann-Whitney U test for comparing DTW distances for empirical and simulated bursts for QEMU. A p-value < 0.05 means that the empirical DTW distances are significantly smaller than the simulated DTW distances. Smoothing parameter and band-window size: 1ξ
QEMU
ξ	mailing-list abstraction	mean ± std. dev. of empirical DTW distances	mean ± std. dev. of simulated DTW distances	U	p-value	Cliff's Delta
feature • lower-bound approach
1	message	0.25 ± 0.28	0.45 ± 0.42	13366	1.36e-08	-0.3333
5	message	0.31 ± 0.47	1.20 ± 1.28	69318	< 2.2e-16	-0.7585
10	message	0.30 ± 0.51	1.38 ± 1.51	186200	< 2.2e-16	-0.7823
15	message	0.25 ± 0.45	1.43 ± 1.56	288960	< 2.2e-16	-0.8110
1	conversation	0.44 ± 0.42	0.59 ± 0.49	4628.5	0.004488	-0.2057
5	conversation	0.86 ± 1.00	1.52 ± 1.52	24226	6.055e-16	-0.3929
10	conversation	0.88 ± 1.10	1.77 ± 1.83	51951	< 2.2e-16	-0.4394
15	conversation	0.86 ± 1.12	1.90 ± 1.93	73603	< 2.2e-16	-0.4855
file • lower-bound approach
1	message	0.29 ± 0.34	0.52 ± 0.52	92955	< 2.2e-16	-0.3160
5	message	0.50 ± 0.79	1.65 ± 2.10	437570	< 2.2e-16	-0.6608
10	message	0.43 ± 0.79	1.89 ± 2.44	866550	< 2.2e-16	-0.7391
15	message	0.37 ± 0.69	1.91 ± 2.39	1278900	< 2.2e-16	-0.7704
1	conversation	0.52 ± 0.55	0.74 ± 0.63	43613	5.402e-09	-0.2529
5	conversation	1.07 ± 1.30	2.01 ± 2.38	220690	2.2e-16	-0.3936
10	conversation	1.07 ± 1.43	2.34 ± 2.81	424970	< 2.2e-16	-0.4470
15	conversation	1.05 ± 1.45	2.39 ± 2.78	605500	< 2.2e-16	-0.4708
feature • upper-bound approach
1	message	10.66 ± 18.13	8.70 ± 15.16	376520	0.998	0.0836
5	message	15.65 ± 15.90	19.40 ± 20.44	2548600	1.796e-06	-0.0811
10	message	11.80 ± 11.30	14.50 ± 14.64	6174500	2.218e-09	-0.0821
15	message	9.40 ± 8.88	11.83 ± 11.65	9783500	< 2.2e-16	-0.1013
1	conversation	7.15 ± 14.36	8.49 ± 15.43	137800	0.02016	-0.0734
5	conversation	9.44 ± 15.92	10.91 ± 18.81	894930	0.0106	-0.0524
10	conversation	7.09 ± 11.63	8.60 ± 14.55	1839100	4.059e-05	-0.0743
15	conversation	5.80 ± 9.16	7.38 ± 11.87	2566300	1.468e-10	-0.1083
file • upper-bound approach
1	message	10.52 ± 16.85	8.92 ± 14.64	1118600	0.9975	0.0608
5	message	15.45 ± 15.18	20.02 ± 20.04	6038900	< 2.2e-16	-0.1127
10	message	11.12 ± 10.60	14.74 ± 14.26	13431000	< 2.2e-16	-0.1306
15	message	8.85 ± 8.22	11.75 ± 11.12	21168000	< 2.2e-16	-0.1365
1	conversation	6.39 ± 12.13	8.77 ± 15.89	556040	0.0001101	-0.0912
5	conversation	7.67 ± 12.94	9.92 ± 16.73	2976900	9.035e-10	-0.0989
10	conversation	5.22 ± 8.76	7.42 ± 12.11	5537900	< 2.2e-16	-0.1569
15	conversation	4.03 ± 6.53	6.05 ± 9.51	7637000	< 2.2e-16	-0.1863

Smoothing parameter and band-window size: 2ξ

(Jump to data for BusyBox | OpenSSL | QEMU)

**Hypothesis H3.1.** One-tailed Mann-Whitney U test for comparing DTW distances for empirical and simulated bursts for BusyBox. A p-value < 0.05 means that the empirical DTW distances are significantly smaller than the simulated DTW distances. Smoothing parameter and band-window size: 2ξ
BusyBox
ξ	mailing-list abstraction	mean ± std. dev. of empirical DTW distances	mean ± std. dev. of simulated DTW distances	U	p-value	Cliff's Delta
feature • lower-bound approach
1	message	0.15 ± 0.13	0.56 ± 0.21	9	0.004511	-0.9032
5	message	0.06 ± 0.08	0.67 ± 0.39	10	7.792e-07	-0.9931
10	message	0.26 ± 0.70	0.72 ± 0.61	1090.5	3.691e-06	-0.7537
15	message	0.25 ± 0.64	0.76 ± 0.72	1897.5	2.772e-07	-0.7522
1	conversation	0.14 ± 0.13	0.58 ± 0.20	6	0.00445	-0.9149
5	conversation	0.06 ± 0.08	0.66 ± 0.38	11	9.953e-07	-0.9887
10	conversation	0.28 ± 0.72	0.74 ± 0.57	515	3.493e-06	-0.7928
15	conversation	0.26 ± 0.66	0.78 ± 0.73	1006	4.04e-07	-0.7704
file • lower-bound approach
1	message	0.43 ± 0.38	1.11 ± 1.03	1080	0.0003705	-0.5325
5	message	0.56 ± 1.02	1.60 ± 2.11	7514	5.504e-07	-0.5574
10	message	0.85 ± 2.27	1.53 ± 2.34	21183	6.131e-07	-0.4771
15	message	0.66 ± 1.75	1.53 ± 2.39	25236	5.234e-10	-0.5679
1	conversation	0.43 ± 0.39	1.33 ± 1.15	615.5	6.06e-05	-0.6126
5	conversation	0.56 ± 1.06	2.08 ± 2.49	4044	3.467e-08	-0.6206
10	conversation	0.56 ± 0.89	2.13 ± 2.78	7878.5	1.517e-08	-0.5909
15	conversation	0.44 ± 0.82	2.11 ± 2.85	8823	1.125e-12	-0.6924
feature • upper-bound approach
1	message	11.24 ± 16.02	20.90 ± 23.85	19416	0.009991	-0.2185
5	message	10.32 ± 11.54	12.77 ± 14.81	105930	0.1709	-0.0697
10	message	7.74 ± 9.16	8.78 ± 10.30	190510	0.1648	-0.0654
15	message	5.89 ± 6.94	6.84 ± 7.89	251590	0.05942	-0.0995
1	conversation	8.93 ± 12.08	18.79 ± 18.55	6776.5	0.0008398	-0.3245
5	conversation	7.77 ± 9.86	15.76 ± 16.14	28038	5.451e-06	-0.3428
10	conversation	6.67 ± 8.91	13.11 ± 13.77	43935	1.106e-06	-0.3468
15	conversation	5.31 ± 7.17	11.06 ± 11.80	56581	1.789e-07	-0.3562
file • upper-bound approach
1	message	12.51 ± 17.60	17.86 ± 22.42	127270	0.006106	-0.1588
5	message	8.64 ± 10.47	11.35 ± 13.50	438340	0.01162	-0.1235
10	message	5.81 ± 8.01	6.94 ± 9.03	749250	0.000664	-0.1597
15	message	4.40 ± 6.01	5.39 ± 6.79	989630	1.55e-05	-0.1960
1	conversation	9.42 ± 14.18	16.12 ± 18.44	43538	2.816e-05	-0.2748
5	conversation	6.31 ± 9.88	12.42 ± 14.94	112770	2.04e-09	-0.3413
10	conversation	4.77 ± 7.58	9.54 ± 11.95	169080	7.836e-11	-0.3503
15	conversation	3.71 ± 5.99	7.81 ± 9.95	212860	7.075e-13	-0.3686

**Hypothesis H3.1.** One-tailed Mann-Whitney U test for comparing DTW distances for empirical and simulated bursts for OpenSSL. A p-value < 0.05 means that the empirical DTW distances are significantly smaller than the simulated DTW distances. Smoothing parameter and band-window size: 2ξ
OpenSSL
ξ	mailing-list abstraction	mean ± std. dev. of empirical DTW distances	mean ± std. dev. of simulated DTW distances	U	p-value	Cliff's Delta
feature • lower-bound approach
1	message	-- ± --	-- ± --	--	--	--
5	message	1.42 ± --	1.16 ± 0.91	93	0.8143	0.5122
10	message	1.48 ± 1.10	1.37 ± 1.20	361	0.6578	0.1352
15	message	0.94 ± 0.91	1.49 ± 1.49	336	0.2603	-0.2168
1	conversation	-- ± --	-- ± --	--	--	--
5	conversation	-- ± --	-- ± --	--	--	--
10	conversation	-- ± --	-- ± --	--	--	--
15	conversation	-- ± --	-- ± --	--	--	--
file • lower-bound approach
1	message	0.03 ± --	0.67 ± 0.27	0	0.05063	-1.0000
5	message	0.55 ± 0.55	0.99 ± 0.63	117	0.202	-0.3464
10	message	0.86 ± 0.69	1.18 ± 0.88	502	0.2177	-0.2277
15	message	0.57 ± 0.49	1.22 ± 0.95	547	0.02462	-0.5116
1	conversation	-- ± --	-- ± --	--	--	--
5	conversation	1.11 ± --	0.96 ± 0.46	29	0.7692	0.5263
10	conversation	1.18 ± --	1.18 ± 0.67	35	0.6355	0.1864
15	conversation	1.09 ± --	1.15 ± 0.54	41	0.5168	0.0123
feature • upper-bound approach
1	message	9.65 ± 10.98	14.65 ± 16.33	153510	0.01714	-0.1525
5	message	9.99 ± 7.72	12.60 ± 9.50	717970	0.04919	-0.0854
10	message	5.61 ± 6.91	7.28 ± 9.10	1219600	0.02131	-0.0936
15	message	4.53 ± 5.50	5.80 ± 7.02	1721300	0.01697	-0.0900
1	conversation	4.55 ± 4.20	4.04 ± 4.03	64523	0.7547	0.0633
5	conversation	3.80 ± 3.93	4.46 ± 4.42	185520	0.1033	-0.0883
10	conversation	3.17 ± 3.79	3.98 ± 4.20	274710	0.01472	-0.1388
15	conversation	2.74 ± 3.51	3.68 ± 3.89	342900	0.0005729	-0.1917
file • upper-bound approach
1	message	9.55 ± 10.96	13.97 ± 15.97	186460	0.03323	-0.1274
5	message	6.36 ± 8.73	9.17 ± 12.00	983850	0.000782	-0.1438
10	message	4.63 ± 6.26	6.66 ± 8.69	1689600	0.0002704	-0.1409
15	message	3.79 ± 4.95	5.22 ± 6.58	2488300	0.000248	-0.1305
1	conversation	4.62 ± 4.38	4.20 ± 3.94	76566	0.5918	0.0199
5	conversation	3.48 ± 3.91	4.39 ± 4.52	227090	0.0146	-0.1399
10	conversation	2.78 ± 3.75	3.99 ± 4.35	332640	3.53e-05	-0.2275
15	conversation	2.44 ± 3.49	3.63 ± 4.07	420070	6.842e-07	-0.2593

**Hypothesis H3.1.** One-tailed Mann-Whitney U test for comparing DTW distances for empirical and simulated bursts for QEMU. A p-value < 0.05 means that the empirical DTW distances are significantly smaller than the simulated DTW distances. Smoothing parameter and band-window size: 2ξ
QEMU
ξ	mailing-list abstraction	mean ± std. dev. of empirical DTW distances	mean ± std. dev. of simulated DTW distances	U	p-value	Cliff's Delta
feature • lower-bound approach
1	message	0.29 ± 0.36	0.65 ± 0.51	7842	< 2.2e-16	-0.6088
5	message	0.20 ± 0.35	1.10 ± 1.24	52735	< 2.2e-16	-0.8162
10	message	0.16 ± 0.33	1.16 ± 1.30	130300	< 2.2e-16	-0.8477
15	message	0.13 ± 0.28	1.14 ± 1.28	226660	< 2.2e-16	-0.8518
1	conversation	0.66 ± 0.66	0.79 ± 0.60	4543	0.002579	-0.2204
5	conversation	0.66 ± 0.85	1.37 ± 1.44	22011	< 2.2e-16	-0.4484
10	conversation	0.62 ± 0.83	1.51 ± 1.55	43588	< 2.2e-16	-0.5296
15	conversation	0.61 ± 0.87	1.56 ± 1.58	65972	< 2.2e-16	-0.5388
file • lower-bound approach
1	message	0.40 ± 0.48	0.89 ± 0.84	60276	< 2.2e-16	-0.5565
5	message	0.30 ± 0.56	1.46 ± 1.91	327120	< 2.2e-16	-0.7464
10	message	0.22 ± 0.46	1.53 ± 1.93	656910	< 2.2e-16	-0.8023
15	message	0.17 ± 0.39	1.45 ± 1.77	950620	< 2.2e-16	-0.8293
1	conversation	0.72 ± 0.73	1.05 ± 0.90	40476	2.124e-12	-0.3066
5	conversation	0.80 ± 1.05	1.74 ± 2.12	201110	< 2.2e-16	-0.4474
10	conversation	0.73 ± 1.05	1.88 ± 2.20	375170	< 2.2e-16	-0.5118
15	conversation	0.68 ± 1.01	1.83 ± 2.07	530830	< 2.2e-16	-0.5361
feature • upper-bound approach
1	message	20.97 ± 26.46	28.03 ± 32.23	300040	1.34e-06	-0.1365
5	message	12.61 ± 12.51	16.11 ± 16.27	2482000	9.51e-10	-0.1051
10	message	8.36 ± 7.89	10.51 ± 9.98	5960800	< 2.2e-16	-0.1139
15	message	6.19 ± 5.82	7.98 ± 7.36	9399200	< 2.2e-16	-0.1366
1	conversation	11.44 ± 20.58	13.90 ± 22.75	135800	0.007634	-0.0868
5	conversation	7.85 ± 12.93	9.23 ± 15.42	884480	0.00263	-0.0635
10	conversation	5.44 ± 8.50	6.83 ± 10.90	1794100	1.35e-07	-0.0969
15	conversation	4.17 ± 6.20	5.54 ± 8.25	2480600	4.747e-16	-0.1381
file • upper-bound approach
1	message	21.41 ± 25.35	30.52 ± 32.38	881640	1.634e-14	-0.1639
5	message	12.31 ± 11.86	16.49 ± 15.86	5858400	< 2.2e-16	-0.1392
10	message	7.77 ± 7.30	10.55 ± 9.61	12954000	< 2.2e-16	-0.1614
15	message	5.74 ± 5.32	7.82 ± 6.93	20268000	< 2.2e-16	-0.1732
1	conversation	10.22 ± 17.17	14.95 ± 23.59	532990	8.911e-08	-0.1289
5	conversation	6.27 ± 10.29	8.32 ± 13.54	2927900	2.34e-12	-0.1138
10	conversation	3.88 ± 6.17	5.80 ± 8.83	5349600	2.2e-16	-0.1856
15	conversation	2.79 ± 4.23	4.46 ± 6.41	7273000	< 2.2e-16	-0.2251

Smoothing parameter and band-window size: 3ξ

(Jump to data for BusyBox | OpenSSL | QEMU)

**Hypothesis H3.1.** One-tailed Mann-Whitney U test for comparing DTW distances for empirical and simulated bursts for BusyBox. A p-value < 0.05 means that the empirical DTW distances are significantly smaller than the simulated DTW distances. Smoothing parameter and band-window size: 3ξ
BusyBox
ξ	mailing-list abstraction	mean ± std. dev. of empirical DTW distances	mean ± std. dev. of simulated DTW distances	U	p-value	Cliff's Delta
feature • lower-bound approach
1	message	0.06 ± 0.04	0.47 ± 0.16	0	0.001911	-1.0000
5	message	0.05 ± 0.08	0.62 ± 0.39	14	8.333e-07	-0.9903
10	message	0.22 ± 0.59	0.65 ± 0.58	1177	6.321e-06	-0.7342
15	message	0.21 ± 0.54	0.68 ± 0.65	2144	8.25e-07	-0.7200
1	conversation	0.06 ± 0.04	0.48 ± 0.16	0	0.002111	-1.0000
5	conversation	0.05 ± 0.08	0.62 ± 0.37	14.5	1.084e-06	-0.9851
10	conversation	0.24 ± 0.62	0.67 ± 0.55	583	7.156e-06	-0.7655
15	conversation	0.23 ± 0.56	0.71 ± 0.69	1284	2.99e-06	-0.7070
file • lower-bound approach
1	message	0.35 ± 0.35	0.98 ± 0.97	1111	0.0005033	-0.5190
5	message	0.51 ± 0.90	1.46 ± 1.94	7544	5.951e-07	-0.5557
10	message	0.73 ± 1.93	1.36 ± 2.07	19760	9.511e-08	-0.5123
15	message	0.59 ± 1.59	1.34 ± 2.08	24325	1.81e-10	-0.5835
1	conversation	0.37 ± 0.34	1.18 ± 1.07	634	8.147e-05	-0.6010
5	conversation	0.52 ± 0.92	1.90 ± 2.29	4121	4.908e-08	-0.6134
10	conversation	0.49 ± 0.87	1.89 ± 2.47	7063	1.457e-09	-0.6333
15	conversation	0.39 ± 0.78	1.84 ± 2.49	8829.5	1.143e-12	-0.6922
feature • upper-bound approach
1	message	9.78 ± 13.93	18.42 ± 20.98	19317	0.00892	-0.2224
5	message	9.15 ± 10.53	11.72 ± 13.71	104260	0.1249	-0.0844
10	message	6.54 ± 7.76	7.73 ± 9.17	186130	0.09768	-0.0869
15	message	4.79 ± 5.71	5.87 ± 6.82	245590	0.02895	-0.1209
1	conversation	7.35 ± 10.01	15.75 ± 15.23	6521.5	0.0003522	-0.3499
5	conversation	6.69 ± 8.37	14.09 ± 14.26	26922	1.1e-06	-0.3690
10	conversation	5.47 ± 7.30	11.13 ± 11.51	42819	3.534e-07	-0.3634
15	conversation	4.30 ± 5.79	9.06 ± 9.42	55525	7.129e-08	-0.3682
file • upper-bound approach
1	message	10.61 ± 14.88	15.67 ± 19.66	125580	0.003659	-0.1700
5	message	7.53 ± 10.08	9.52 ± 12.38	429660	0.00482	-0.1409
10	message	4.84 ± 6.78	6.06 ± 7.99	721850	6.48e-05	-0.1905
15	message	3.49 ± 4.83	4.59 ± 5.86	963810	1.997e-06	-0.2170
1	conversation	7.64 ± 11.46	13.50 ± 15.19	42628	1.07e-05	-0.2900
5	conversation	5.34 ± 8.20	11.03 ± 13.12	110460	4.858e-10	-0.3548
10	conversation	3.91 ± 6.21	8.07 ± 9.95	163490	5.552e-12	-0.3718
15	conversation	2.96 ± 4.75	6.41 ± 8.02	210340	2.481e-13	-0.3761

**Hypothesis H3.1.** One-tailed Mann-Whitney U test for comparing DTW distances for empirical and simulated bursts for OpenSSL. A p-value < 0.05 means that the empirical DTW distances are significantly smaller than the simulated DTW distances. Smoothing parameter and band-window size: 3ξ
OpenSSL
ξ	mailing-list abstraction	mean ± std. dev. of empirical DTW distances	mean ± std. dev. of simulated DTW distances	U	p-value	Cliff's Delta
feature • lower-bound approach
1	message	-- ± --	-- ± --	--	--	--
5	message	1.61 ± --	1.12 ± 0.89	103	0.8797	0.6748
10	message	1.29 ± 0.97	1.28 ± 1.14	346	0.605	0.0881
15	message	0.83 ± 0.78	1.37 ± 1.39	323	0.2319	-0.2471
1	conversation	-- ± --	-- ± --	--	--	--
5	conversation	-- ± --	-- ± --	--	--	--
10	conversation	-- ± --	-- ± --	--	--	--
15	conversation	-- ± --	-- ± --	--	--	--
file • lower-bound approach
1	message	0.00 ± --	0.56 ± 0.20	0	0.05094	-1.0000
5	message	0.53 ± 0.53	0.90 ± 0.60	122	0.2216	-0.3184
10	message	0.70 ± 0.58	1.04 ± 0.79	453	0.1493	-0.3031
15	message	0.52 ± 0.46	1.04 ± 0.84	584	0.03292	-0.4786
1	conversation	-- ± --	-- ± --	--	--	--
5	conversation	1.06 ± --	0.89 ± 0.48	31	0.8205	0.6316
10	conversation	1.02 ± --	1.04 ± 0.54	32	0.5688	0.0847
15	conversation	1.13 ± --	1.01 ± 0.47	53	0.7086	0.3086
feature • upper-bound approach
1	message	8.31 ± 9.27	12.61 ± 13.89	153900	0.01845	-0.1503
5	message	6.83 ± 8.41	9.05 ± 11.33	706680	0.02675	-0.0998
10	message	4.72 ± 5.80	6.34 ± 7.83	1197400	0.008515	-0.1101
15	message	3.79 ± 4.62	4.98 ± 5.96	1689900	0.005997	-0.1066
1	conversation	4.05 ± 3.63	3.61 ± 3.47	64187	0.7353	0.0578
5	conversation	4.13 ± 3.39	4.04 ± 3.53	181240	0.05902	-0.1093
10	conversation	2.86 ± 3.48	3.55 ± 3.66	271270	0.009464	-0.1495
15	conversation	2.35 ± 3.15	3.22 ± 3.31	329440	7.515e-05	-0.2234
file • upper-bound approach
1	message	8.20 ± 9.26	11.99 ± 13.58	186460	0.03326	-0.1274
5	message	5.52 ± 7.65	8.28 ± 10.80	961030	0.0001596	-0.1637
10	message	3.86 ± 5.27	5.78 ± 7.47	1645000	2.962e-05	-0.1636
15	message	3.11 ± 4.07	4.46 ± 5.55	2419400	1.848e-05	-0.1545
1	conversation	4.03 ± 3.82	3.68 ± 3.40	76082	0.5624	0.0135
5	conversation	3.10 ± 3.49	4.02 ± 4.10	223210	0.007976	-0.1546
10	conversation	2.39 ± 3.31	3.52 ± 3.77	319020	2.998e-06	-0.2591
15	conversation	2.04 ± 3.02	3.14 ± 3.45	400920	2.407e-08	-0.2931

**Hypothesis H3.1.** One-tailed Mann-Whitney U test for comparing DTW distances for empirical and simulated bursts for QEMU. A p-value < 0.05 means that the empirical DTW distances are significantly smaller than the simulated DTW distances. Smoothing parameter and band-window size: 3ξ
QEMU
ξ	mailing-list abstraction	mean ± std. dev. of empirical DTW distances	mean ± std. dev. of simulated DTW distances	U	p-value	Cliff's Delta
feature • lower-bound approach
1	message	0.26 ± 0.38	0.65 ± 0.52	7026	< 2.2e-16	-0.6495
5	message	0.15 ± 0.29	1.04 ± 1.17	45847	< 2.2e-16	-0.8402
10	message	0.11 ± 0.27	1.04 ± 1.14	118750	< 2.2e-16	-0.8612
15	message	0.09 ± 0.22	1.00 ± 1.11	221620	< 2.2e-16	-0.8551
1	conversation	0.60 ± 0.65	0.75 ± 0.59	4427.5	0.00115	-0.2402
5	conversation	0.55 ± 0.76	1.29 ± 1.34	19858	< 2.2e-16	-0.5023
10	conversation	0.52 ± 0.75	1.36 ± 1.37	40950	< 2.2e-16	-0.5581
15	conversation	0.49 ± 0.71	1.37 ± 1.37	61506	< 2.2e-16	-0.5700
file • lower-bound approach
1	message	0.38 ± 0.49	0.86 ± 0.85	59664	< 2.2e-16	-0.5610
5	message	0.22 ± 0.45	1.34 ± 1.76	279790	< 2.2e-16	-0.7831
10	message	0.15 ± 0.36	1.35 ± 1.67	570080	< 2.2e-16	-0.8284
15	message	0.11 ± 0.28	1.24 ± 1.48	854640	< 2.2e-16	-0.8466
1	conversation	0.66 ± 0.70	0.95 ± 0.86	40636	3.276e-12	-0.3039
5	conversation	0.66 ± 0.92	1.58 ± 1.90	185550	< 2.2e-16	-0.4901
10	conversation	0.59 ± 0.88	1.65 ± 1.88	348860	< 2.2e-16	-0.5460
15	conversation	0.53 ± 0.84	1.56 ± 1.71	494030	< 2.2e-16	-0.5682
feature • upper-bound approach
1	message	21.35 ± 25.74	30.48 ± 34.41	295000	1.039e-07	-0.1510
5	message	11.37 ± 11.26	15.04 ± 15.19	2434000	1.333e-12	-0.1224
10	message	7.04 ± 6.66	9.16 ± 8.58	5810100	< 2.2e-16	-0.1363
15	message	4.95 ± 4.67	6.68 ± 6.09	9085400	< 2.2e-16	-0.1654
1	conversation	10.36 ± 18.17	12.21 ± 19.66	137750	0.0197	-0.0737
5	conversation	6.82 ± 10.85	8.22 ± 13.20	871040	0.0003181	-0.0777
10	conversation	4.45 ± 6.63	5.75 ± 8.64	1744500	4.952e-11	-0.1219
15	conversation	3.27 ± 4.63	4.53 ± 6.31	2405800	< 2.2e-16	-0.1641
file • upper-bound approach
1	message	22.23 ± 25.10	33.60 ± 35.03	866330	< 2.2e-16	-0.1785
5	message	11.03 ± 10.64	15.36 ± 14.77	5732400	< 2.2e-16	-0.1577
10	message	6.51 ± 6.13	9.15 ± 8.25	12612000	< 2.2e-16	-0.1836
15	message	4.55 ± 4.26	6.51 ± 5.70	19533000	< 2.2e-16	-0.2032
1	conversation	9.27 ± 15.23	13.11 ± 20.43	541740	1.734e-06	-0.1146
5	conversation	5.39 ± 8.51	7.37 ± 11.48	2871600	9.239e-16	-0.1308
10	conversation	3.12 ± 4.73	4.86 ± 6.94	5174200	< 2.2e-16	-0.2123
15	conversation	2.14 ± 3.09	3.63 ± 4.87	6975000	< 2.2e-16	-0.2569