Righto, I've been doing a little tinkering and tuning, and I've run the text with a slightly larger word-count limit which has excluded some of the less effusive forumistas. My own top-5 comparisons are now:
Simon Kirby and Exhausted - 71.2%
Simon Kirby and On the edge - 67.2%
Simon Kirby and Phil_D11102 - 67.2%
Simon Kirby and lordtup - 66.6%
Simon Kirby and NWNREADER - 65.9%
and I am least like - wait for it...
Simon Kirby and Swift Hook - 32.3% - ain't that the truth!
I did say that I wasn't going to out anyone, but as Richard asked the question I think it's only fair to say that Richard's second closest comparison was...
Richard Garvie and blackdog - 73.9% - so it turns out that blackdog is actually RIchard Garvie!
I will will just share one more - blackdog's least closest match is...
blackdog and Swift Hook - 29.8%
If the Leader's reports really were written by JSH (and I can't imagine that they'd have been written by anyone else) then that really puts the nails in the coffin of Richard's theory.
motormad's top five:
motormad and xjay1337 - 98.6%
motormad and Timbo - 98.0%
motormad and JeffG - 89.2%
motormad and x2lls - 85.0%
motormad and Strafin - 80.5%
Andy Capp's top five:
Andy Capp and Iommi - 90.0%
Andy Capp and Strafin - 70.2%
Andy Capp and JeffG - 69.5%
Andy Capp and Bloggo - 66.8%
Andy Capp and x2lls - 66.5%
Nothing Much's top five:
Nothing Much and xjay1337 - 60.2%
Nothing Much and motormad - 51.9%
Nothing Much and Bill1 - 49.2%
Nothing Much and Timbo - 48.8%
Nothing Much and JeffG - 48.6%
On the Edge's top five:
On the edge and Darren - 77.7%
On the edge and Bofem - 77.2%
On the edge and spartacus - 77.1%
On the edge and Jayjay - 75.5%
On the edge and lordtup - 74.3%
To answer blackdog's question about confidence levels, I think the answer is that the technique isn't really valid in that quantitive kind of way. If you have a text and you want to attribute it to an author and you have some samples from a set of candidate authors then you can use the technique to suggest the most likely author from that set of candidates but that's not the same thing. This was done with the Glabraith attribution to see if Galbraith was J K Rowling, but even comparing the text with the published text from just four candidate authors Rowling wasn't the best match for every test.
To give an idea though, I would suggest that numbers above 90% here are likely for texts from common authors, but I see at least one such correlation between two forumistas who I know to be different people so there really isn't any great confidence.
Can I interest anyone else in seeing their comparisons?