"He Said She Said" classification challenge

Guess whether a text in Polish was written by a man or woman.

Git repo URL: git://gonito.net/petite-difference-challenge / Branch: master

(Browse at http://gonito.net/gitlist/petite-difference-challenge.git/master)

Leaderboard

# submitter when description test-A/Accuracy ×
1 p/tlen 2016-05-15 18:31 VW tokens + 3-gram LM (+fix for the latest grep) 0.710777835465144 17
2 nozdi 2016-06-17 09:52 3 best voting 0.704700995893708 21
3 Marta 2016-05-23 04:15 nozdi Naive Bayes + Tfidf + swear words + emoticons 0.6583164204465 4
4 [anonymised] 2016-06-23 09:01 3gram model KenLM + stemming 0.656475669042337 4
5 Marek 2016-05-24 11:55 klon rozwiazania Mateusza + RandomForestClassifier 0.652923962807382 1
6 Przemysław Nowaczyk 2015-12-10 09:05 naive bayes by Przemysław Nowaczyk kod zrodlowy i zasoby 0.646864822768679 2
7 asdf 2016-05-24 17:13 lemma + nozdi naive bayes 0.639885307027894 1
8 Veal 2016-02-15 18:27 Fixed source code, added makefile. 0.634050361070468 5
9 [anonymised] 2015-12-17 07:34 pliki zrodlowe w odp. folderach 0.628840798602917 2
10 Jacek 2016-05-16 16:08 100k samle with RandomForestClassifier 0.619176853731061 7
11 Katarzyna 2016-05-23 13:20 more iterations + randomized start 0.613518903100958 5
12 Maxi 2016-05-14 14:55 Added naive_bayes.py 0.600698541558503 2
13 [anonymised] 2015-12-16 21:17 Dodane kody zrodlowe 0.571913201491481 4
14 R.J. 2016-02-12 07:55 men only baseline 0.5 2