"He Said She Said" classification challenge

Guess whether a text in Polish was written by a man or woman. [ver. 1.0.0]

Git repo URL: git://gonito.net/petite-difference-challenge / Branch: master

(Browse at https://gonito.net/gitlist/petite-difference-challenge.git/master)

Leaderboard

# submitter when ver. description test-A Accuracy ×
1 p/tlen 2016-05-15 18:31 1.0.0 VW tokens + 3-gram LM (+fix for the latest grep) 0.7107778354651437 18
2 nozdi 2016-06-17 09:52 1.0.0 3 best voting 0.7047009958937084 21
3 Marta 2016-05-23 04:15 1.0.0 nozdi Naive Bayes + Tfidf + swear words + emoticons 0.6583164204465002 4
4 [anonymised] 2016-06-23 09:01 1.0.0 3gram model KenLM + stemming 0.6564756690423372 6
5 Marek 2016-05-24 11:55 1.0.0 klon rozwiazania Mateusza + RandomForestClassifier 0.6529239628073819 1
6 [anonymised] 2015-12-10 09:05 1.0.0 naive bayes by Przemysław Nowaczyk kod zrodlowy i zasoby 0.646864822768679 2
7 asdf 2016-05-24 17:13 1.0.0 lemma + nozdi naive bayes 0.6398853070278945 1
8 Veal 2016-02-15 18:27 1.0.0 Fixed source code, added makefile. 0.6340503610704677 6
9 [anonymised] 2015-12-17 07:34 1.0.0 pliki zrodlowe w odp. folderach 0.6288407986029169 3
10 Jacek 2016-05-16 16:08 1.0.0 100k samle with RandomForestClassifier 0.6191768537310615 7
11 Katarzyna 2016-05-23 13:20 1.0.0 more iterations + randomized start 0.6135189031009581 5
12 Maxi 2016-05-14 14:55 1.0.0 Added naive_bayes.py 0.6006985415585029 2
13 [anonymised] 2015-12-16 21:17 1.0.0 Dodane kody zrodlowe 0.5719132014914806 4
14 R.J. 2016-02-12 07:55 1.0.0 men only baseline 0.5 2