-
Notifications
You must be signed in to change notification settings - Fork 7
/
Copy pathdetection_results.txt
379 lines (326 loc) · 17.4 KB
/
detection_results.txt
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
240
241
242
243
244
245
246
247
248
249
250
251
252
253
254
255
256
257
258
259
260
261
262
263
264
265
266
267
268
269
270
271
272
273
274
275
276
277
278
279
280
281
282
283
284
285
286
287
288
289
290
291
292
293
294
295
296
297
298
299
300
301
302
303
304
305
306
307
308
309
310
311
312
313
314
315
316
317
318
319
320
321
322
323
324
325
326
327
328
329
330
331
332
333
334
335
336
337
338
339
340
341
342
343
344
345
346
347
348
349
350
351
352
353
354
355
356
357
358
359
360
361
362
363
364
365
366
367
368
369
370
371
372
373
374
375
376
377
378
379
Using Gutenberg data to build Language Vector, testing on Europarl:
n=10000, k=5000:
0.7026, cluster 1, unordered - var 0.00770797418387
0.7015, clusrer 1, ordered - var 0.00733196525569
0.8267, cluster 2, unordered - var 0.0146911988655
0.8739, cluster 2, ordered - var 0.0175363099891
0.8403, cluster 3, unordered - var 0.014884196191
0.8972, cluster 3, ordered - var 0.0152095671908
0.8109, cluster 4, unordered - var 0.0132731596119
0.8581, clusters 4, ordered - var 0.00881416457223
0.7662, clusters 5, unordered - var 0.0113622425598
0.7243, clusters 5, ordered - var 0.0103926946532
0.79 using, clusters 1 to 3, unordered
0.798 using, clusters 1 to 3, ordered
0.8902, clusters 2 to 3, ordered
0.8448, clusters 2 to 3, unorderd
n=10000, k=500
0.700578947368 cluster 1, ordered - var 0.00772166929846
0.494368421053 cluster 2, ordered - var 0.0187395413045
0.205105263158 cluster 3, ordered - var 0.0158221406892
0.052263157895 cluster 4, ordered - var 0.00929805220971
0.051947368421 cluster 5, ordered - var 0.00970820506111
N=1000, k=10, unordered
cluster size 1: 0.702473684211 variance= 0.0068957747879
clustes size 2: 0.580684210526 variance= 0.0240669001062
cluster size 3: 0.148894736842 variance= 0.0917759523055
cluster size 4: 0.0526315789474 variance= nan
cluster size 5: 0.0526315789474 variance= nan
N=1000, k=10, ordered
cluster size 1: 0.702473684211 variance= 0.0068957747879
clustes size 2: 0.525947368421 variance= 0.028626262122
cluster size 3: 0.134578947368 variance= 0.0876637231551
cluster size 4: 0.0526842105263 variance= nan
cluster size 5: 0.0526315789474 variance= nan
N=1000, k=100, unordered
cluster size 1: 0.701 variance= 0.00766485002661
clustes size 2: 0.815789473684 variance= 0.0137856081087
cluster size 3: 0.740368421053 variance= 0.0161201758345
cluster size 4: 0.470368421053 variance= 0.0143867013499
cluster size 5: 0.234578947368 variance= 0.0198663255719
N=1000, k=100, ordered
cluster size 1: 0.701 variance= 0.00766485002661
clustes size 2: 0.857684210526 variance= 0.0176984596733
cluster size 3: 0.789684210526 variance= 0.0167865329561
cluster size 4: 0.507684210526 variance= 0.0126534400483
cluster size 5: 0.219368421053 variance= 0.0169243247793
N=1000, k=500, unordered
cluster size 1: 0.700684210526 variance= 0.007801293169
clustes size 2: 0.820526315789 variance= 0.01457457638
cluster size 3: 0.818842105263 variance= 0.0183383277895
cluster size 4: 0.746526315789 variance= 0.015868958431
cluster size 5: 0.610368421053 variance= 0.0152337965315
N=1000, k=500, ordered
cluster size 1: 0.700684210526 variance= 0.007801293169
clustes size 2: 0.869789473684 variance= 0.0180102314943
cluster size 3: 0.845368421053 variance= 0.0159053335883
cluster size 4: 0.673473684211 variance= 0.00976670007713
cluster size 5: 0.361315789474 variance= 0.0104499971903
N=2000, k=20, unordered
cluster size 1: 0.700368421053 variance= 0.00769074768977
clustes size 2: 0.640526315789 variance= 0.0221487298555
cluster size 3: 0.204473684211 variance= 0.0378126424637
cluster size 4: 0.0664736842105 variance= 0.138256387266
cluster size 5: 0.0506842105263 variance= nan
N=2000, k=20, ordered
cluster size 1: 0.700368421053 variance= 0.00769074768977
clustes size 2: 0.630105263158 variance= 0.0264241487937
cluster size 3: 0.239526315789 variance= 0.0361436486445
cluster size 4: 0.0793157894737 variance= 0.101085955584
cluster size 5: 0.0505789473684 variance= nan
N=2000, k=200, unordered
cluster size 1: 0.699157894737 variance= 0.0079878291321
clustes size 2: 0.822789473684 variance= 0.0126268086715
cluster size 3: 0.777210526316 variance= 0.0142314972777
cluster size 4: 0.613315789474 variance= 0.0138474231588
cluster size 5: 0.328421052632 variance= 0.0209482256821
N=2000, k=200, ordered
cluster size 1: 0.699157894737 variance= 0.0079878291321
clustes size 2: 0.860052631579 variance= 0.0172439729084
cluster size 3: 0.833473684211 variance= 0.0162309370198
cluster size 4: 0.599421052632 variance= 0.00877694184321
cluster size 5: 0.270263157895 variance= 0.0115750797678
N=2000, k=1000, unordered
cluster size 1: 0.700578947368 variance= 0.00802410817571
clustes size 2: 0.822947368421 variance= 0.0153082198484
cluster size 3: 0.823473684211 variance= 0.0174844770842
cluster size 4: 0.770684210526 variance= 0.0139376625345
cluster size 5: 0.679421052632 variance= 0.0121767138499
N=2000, k=1000, ordered
cluster size 1: 0.700578947368 variance= 0.00802410817571
clustes size 2: 0.868736842105 variance= 0.0189144044194
cluster size 3: 0.869526315789 variance= 0.0152931871831
cluster size 4: 0.751947368421 variance= 0.00977705296661
cluster size 5: 0.490157894737 variance= 0.0113004387725
N=5000, k=50, unordered
cluster size 1: 0.701421052632 variance= 0.00759945399446
clustes size 2: 0.650421052632 variance= 0.0218346888101
cluster size 3: 0.238842105263 variance= 0.0458384507115
cluster size 4: 0.0537894736842 variance= 0.204623078462
cluster size 5: 0.0526315789474 variance= nan
N=5000, k=50, ordered
cluster size 1: 0.701421052632 variance= 0.00759945399446
clustes size 2: 0.637631578947 variance= 0.0250116923924
cluster size 3: 0.326368421053 variance= 0.0285224035678
cluster size 4: 0.0693684210526 variance= 0.0444721910411
cluster size 5: 0.0526315789474 variance= nan
N=5000, k=500, unordered
cluster size 1: 0.701263157895 variance= 0.00751020410254
clustes size 2: 0.83 variance= 0.0138940867047
cluster size 3: 0.819684210526 variance= 0.0148676849256
cluster size 4: 0.719368421053 variance= 0.0135177462742
cluster size 5: 0.463842105263 variance= 0.0186928346055
N=5000, k=500, ordered
cluster size 1: 0.701263157895 variance= 0.00751020410254
clustes size 2: 0.866473684211 variance= 0.0174848922984
cluster size 3: 0.873 variance= 0.0154175893028
cluster size 4: 0.742421052632 variance= 0.00991557958183
cluster size 5: 0.413052631579 variance= 0.0104009337859
N=5000, k=2500, unordered
cluster size 1: 0.702368421053 variance= 0.00782653458528
clustes size 2: 0.827157894737 variance= 0.014761191566
cluster size 3: 0.836052631579 variance= 0.0155068192939
cluster size 4: 0.801 variance= 0.012378039929
cluster size 5: 0.737473684211 variance= 0.0118714537228
N=5000, k=2500, ordered
cluster size 1: 0.702368421053 variance= 0.00782653458528
clustes size 2: 0.876736842105 variance= 0.0176761054841
cluster size 3: 0.896263157895 variance= 0.0153612895765
cluster size 4: 0.833947368421 variance= 0.00927902376262
cluster size 5: 0.645526315789 variance= 0.0104743555592
N=8000, k=80, unordered
cluster size 1: 0.700473684211 variance= 0.00784107366714
clustes size 2: 0.646526315789 variance= 0.0229530088544
cluster size 3: 0.282631578947 variance= 0.0306974935575
cluster size 4: 0.0542631578947 variance= 0.24132728251
cluster size 5: 0.0526315789474 variance= nan
N=8000, k=80, ordered
cluster size 1: 0.700473684211 variance= 0.00784107366714
clustes size 2: 0.632684210526 variance= 0.0278157797713
cluster size 3: 0.339947368421 variance= 0.0263757966785
cluster size 4: 0.0944210526316 variance= 0.116541686779
cluster size 5: 0.0526315789474 variance= nan
N=8000, k=800, unordered
cluster size 1: 0.701684210526 variance= 0.00774865807587
clustes size 2: 0.828052631579 variance= 0.0141352032295
cluster size 3: 0.834 variance= 0.0155137067907
cluster size 4: 0.754684210526 variance= 0.0135994164215
cluster size 5: 0.546157894737 variance= 0.0111419854459
N=8000, k=800, ordered
cluster size 1: 0.701684210526 variance= 0.00774865807587
clustes size 2: 0.868210526316 variance= 0.0179412351913
cluster size 3: 0.880157894737 variance= 0.0170210331252
cluster size 4: 0.789052631579 variance= 0.00896783591121
cluster size 5: 0.507 variance= 0.0117588658806
N=8000, k=4000, unordered
cluster size 1: 0.704473684211 variance= 0.00752767547909
clustes size 2: 0.825210526316 variance= 0.0140759945326
cluster size 3: 0.836526315789 variance= 0.0152760806661
cluster size 4: 0.810052631579 variance= 0.012912518608
cluster size 5: 0.759157894737 variance= 0.0118521036338
N=8000, k=4000, ordered
cluster size 1: 0.704473684211 variance= 0.00752767547909
clustes size 2: 0.877684210526 variance= 0.0175414483724
cluster size 3: 0.895105263158 variance= 0.0159226150173
cluster size 4: 0.845578947368 variance= 0.00916986261308
cluster size 5: 0.705315789474 variance= 0.0107957208009
N=10000, k=100, unordered
cluster size 1: 0.703052631579 variance= 0.00761986982949
clustes size 2: 0.673894736842 variance= 0.0207642962776
cluster size 3: 0.348210526316 variance= 0.0248763728366
cluster size 4: 0.0833684210526 variance= 0.0814054827272
cluster size 5: 0.0526315789474 variance= nan
N=10000, k=100, ordered
cluster size 1: 0.703052631579 variance= 0.00761986982949
clustes size 2: 0.635315789474 variance= 0.0245497814966
cluster size 3: 0.458631578947 variance= 0.0205408296061
cluster size 4: 0.0810526315789 variance= 0.0519106576613
cluster size 5: 0.0491578947368 variance= 0.280299153698
N=10000, k=1000, unordered
cluster size 1: 0.701578947368 variance= 0.0078765816441
clustes size 2: 0.831105263158 variance= 0.0149288814048
cluster size 3: 0.830631578947 variance= 0.0148351820801
cluster size 4: 0.771894736842 variance= 0.0129950847438
cluster size 5: 0.545157894737 variance= 0.0144464930234
N=10000, k=1000, ordered
cluster size 1: 0.701578947368 variance= 0.0078765816441
clustes size 2: 0.866736842105 variance= 0.0178675578133
cluster size 3: 0.886052631579 variance= 0.0159150675938
cluster size 4: 0.804263157895 variance= 0.00959469118454
cluster size 5: 0.534631578947 variance= 0.00923133612792
N=10000, k=5000, unordered
cluster size 1: 0.703684210526 variance= 0.00745178688178
clustes size 2: 0.823315789474 variance= 0.0139778108356
cluster size 3: 0.839157894737 variance= 0.014401641792
cluster size 4: 0.811315789474 variance= 0.0127325796899
cluster size 5: 0.767789473684 variance= 0.0119053340489
N=10000, k=5000, ordered
cluster size 1: 0.703684210526 variance= 0.00745178688178
clustes size 2: 0.877684210526 variance= 0.0175656436564
cluster size 3: 0.894842105263 variance= 0.0155581920337
cluster size 4: 0.858368421053 variance= 0.00925882559579
cluster size 5: 0.720210526316 variance= 0.0104920908931
Using webcrawl data to build Language Vectors, testing on Europarl:
N = 1000 , k = 10 , unordered
cluster size 1 : 0.73819047619 , var 0.00962799486584
cluster size 2 : 0.636095238095 , var 0.0348428965616
cluster size 3 : 0.129857142857 , var 0.139453226643
cluster size 4 : 0.0519523809524 , var 0.143137743755
cluster size 5 : 0.047619047619 , var nan
N = 1000 , k = 10 , ordered
cluster size 1 : 0.73819047619 , var 0.00962799486584
cluster size 2 : 0.629714285714 , var 0.0353841184994
cluster size 3 : 0.160952380952 , var 0.0511260343861
cluster size 4 : 0.0491904761905 , var 0.11911933431
cluster size 5 : 0.047619047619 , var nan
N = 1000 , k = 100 , unordered
cluster size 1 : 0.736904761905 , var 0.00912860862283
cluster size 2 : 0.872 , var 0.0179210304156
cluster size 3 : 0.825904761905 , var 0.0177680519671
cluster size 4 : 0.60580952381 , var 0.0223189477473
cluster size 5 : 0.313142857143 , var 0.0211755134303
N = 1000 , k = 100 , ordered
cluster size 1 : 0.736904761905 , var 0.00912860862283
cluster size 2 : 0.910142857143 , var 0.0200588784569
cluster size 3 : 0.888952380952 , var 0.0226772441576
cluster size 4 : 0.657666666667 , var 0.0170486816639
cluster size 5 : 0.35980952381 , var 0.0107443050718
N = 1000 , k = 500 , unordered
cluster size 1 : 0.741428571429 , var 0.00835254641621
cluster size 2 : 0.874523809524 , var 0.0145903296079
cluster size 3 : 0.888476190476 , var 0.0197376683926
cluster size 4 : 0.863523809524 , var 0.0184654057303
cluster size 5 : 0.78 , var 0.0170549771411
N = 1000 , k = 500 , ordered
cluster size 1 : 0.741428571429 , var 0.00835254641621
cluster size 2 : 0.919904761905 , var 0.0187099577749
cluster size 3 : 0.935571428571 , var 0.0184385083962
cluster size 4 : 0.853476190476 , var 0.0117865105001
cluster size 5 : 0.649619047619 , var 0.00742475535205
N = 5000 , k = 50 , unordered
cluster size 1 : 0.742047619048 , var 0.00829834562487
cluster size 2 : 0.720666666667 , var 0.0272634830921
cluster size 3 : 0.273619047619 , var 0.0591448382764
cluster size 4 : 0.0622380952381 , var 0.0635595145262
cluster size 5 : 0.047619047619 , var nan
N = 5000 , k = 50 , ordered
cluster size 1 : 0.742047619048 , var 0.00829834562487
cluster size 2 : 0.70819047619 , var 0.0299993784038
cluster size 3 : 0.337904761905 , var 0.0383646322302
cluster size 4 : 0.0717142857143 , var 0.0582515526831
cluster size 5 : 0.0480476190476 , var 0.141559805675
N = 5000 , k = 500 , unordered
cluster size 1 : 0.742047619048 , var 0.00797173438826
cluster size 2 : 0.88880952381 , var 0.0135409353914
cluster size 3 : 0.899095238095 , var 0.0182358691737
cluster size 4 : 0.847666666667 , var 0.017155298953
cluster size 5 : 0.600619047619 , var 0.0154086716815
N = 5000 , k = 500 , ordered
cluster size 1 : 0.742047619048 , var 0.00797173438826
cluster size 2 : 0.923142857143 , var 0.0163680418077
cluster size 3 : 0.950571428571 , var 0.0190100123499
cluster size 4 : 0.901047619048 , var 0.0141174760944
cluster size 5 : 0.694 , var 0.00816348812232
N = 5000 , k = 2500 , unordered
cluster size 1 : 0.741714285714 , var 0.00822005726318
cluster size 2 : 0.879 , var 0.014129016261
cluster size 3 : 0.910476190476 , var 0.017138429037
cluster size 4 : 0.914571428571 , var 0.0173562393691
cluster size 5 : 0.895952380952 , var 0.0162622446477
N = 5000 , k = 2500 , ordered
cluster size 1 : 0.741714285714 , var 0.00822005726318
cluster size 2 : 0.927952380952 , var 0.0166177759456
cluster size 3 : 0.95780952381 , var 0.0178851548014
cluster size 4 : 0.952047619048 , var 0.012522850902
cluster size 5 : 0.899761904762 , var 0.006443712743
N = 10000 , k = 100 , unordered
cluster size 1 : 0.743761904762 , var 0.00780756309644
cluster size 2 : 0.747857142857 , var 0.0262911289808
cluster size 3 : 0.335476190476 , var 0.0301072432386
cluster size 4 : 0.0609523809524 , var 0.371057254866
cluster size 5 : 0.047619047619 , var nan
N = 10000 , k = 100 , ordered
cluster size 1 : 0.743761904762 , var 0.00780756309644
cluster size 2 : 0.72580952381 , var 0.0291688797654
cluster size 3 : 0.484285714286 , var 0.0279704632492
cluster size 4 : 0.0987142857143 , var 0.0861144457735
cluster size 5 : 0.0484285714286 , var 0.118974568606
N = 10000 , k = 1000 , unordered
cluster size 1 : 0.743714285714 , var 0.00788341407791
cluster size 2 : 0.888904761905 , var 0.0145307571942
cluster size 3 : 0.907952380952 , var 0.0174482759093
cluster size 4 : 0.884952380952 , var 0.0175076856969
cluster size 5 : 0.743952380952 , var 0.0166098654851
N = 10000 , k = 1000 , ordered
cluster size 1 : 0.743714285714 , var 0.00788341407791
cluster size 2 : 0.925523809524 , var 0.017472773866
cluster size 3 : 0.957380952381 , var 0.0174792543031
cluster size 4 : 0.943571428571 , var 0.0123953938384
cluster size 5 : 0.83080952381 , var 0.0070630886038
N = 10000 , k = 5000 , unordered
cluster size 1 : 0.742285714286 , var 0.00787423833099
cluster size 2 : 0.878714285714 , var 0.0142378684153
cluster size 3 : 0.910952380952 , var 0.0170228994737
cluster size 4 : 0.919571428571 , var 0.0172522207604
cluster size 5 : 0.911523809524 , var 0.0161772682367
N = 10000 , k = 5000 , ordered
cluster size 1 : 0.742285714286 , var 0.00787423833099
cluster size 2 : 0.927380952381 , var 0.0169603819556
cluster size 3 : 0.961619047619 , var 0.0181195677123
cluster size 4 : 0.962857142857 , var 0.0121729344364
cluster size 5 : 0.935095238095 , var 0.00671645187211
--------------------------------------------------------------
Retaining spaces:
--------------------------------------------------------------
N = 10000 , k = 5000 , unordered
cluster size 1 : 0.749238095238 , var 0.00578222543492
cluster size 2 : 0.905333333333 , var 0.0127813236416
cluster size 3 : 0.923571428571 , var 0.0149129259615
cluster size 4 : 0.940857142857 , var 0.0168462518353
cluster size 5 : 0.951761904762 , var 0.0149957920637
N = 10000 , k = 5000 , ordered
cluster size 1 : 0.749238095238 , var 0.00578222543492
cluster size 2 : 0.939571428571 , var 0.0193137110487
cluster size 3 : 0.972476190476 , var 0.0207294756583
cluster size 4 : 0.978285714286 , var 0.0145603076755
cluster size 5 : 0.973285714286 , var 0.00836696389008