> The researchers were able to perform text classification using only text compression and a clustering algorithm (kNN).
Wasnt this result disproved, because the positive results were due to a bad kNN implementation? I recall reading something like this but can't recover the exact post/article...
Thank you for this, I appreciate it! That's unfortunate to hear. I may have to swap out the example I used in this article, and maybe also include a note that this technique has limitations. I think that using compression/Kolmogorov complexity metrics for classification is a fruitful endeavor and that the philosophy of groups like the Hutter Prize are sound, but the kNN + gzip example looks like it has some problems with it.
Wasnt this result disproved, because the positive results were due to a bad kNN implementation? I recall reading something like this but can't recover the exact post/article...