56f44d61fcb0afc8ff24a68e8250c6af847801e9
444452 self-made

 

challenge
"He Said She Said" classification challenge (2nd edition)
submitter
s444452
submitted
2022-04-26 21:05:16.309511 UTC
original repo
https://git.wmi.amu.edu.pl/s444452/petite-difference-challenge2.git / branch master
publicly available at
git://gonito.net/petite-difference-challenge2 / branch submission-07332
browsable at
https://gonito.net/gitlist/petite-difference-challenge2.git/submission-07332/
clone by
git clone --single-branch git://gonito.net/petite-difference-challenge2 -b submission-07332
file basename
out

test-A / bf881d501530f84ce601c14cf92ec277fa6e7db7
Metric Score
Likelihood 0.00000
Accuracy 0.63615
Likelihood Accuracy
+H 0.00000 0.64750
+C 0.00000 0.66192
-C 0.00000 0.63534

dev-1 / 2486010210171c14be0cbd2e75341ed66cf0b508
Metric Score
Likelihood 0.00000
Accuracy 0.63821
Likelihood Accuracy
+H 0.00000 0.00000
+C 0.00000 0.00000
-C 0.00000 0.63822

worst items

note: the gold standard is taken from the submission itself, not from the challenge data!
# input expected output actual output test-A Accuracy +C
1 zakończyłem jakiś czas temu. Potem dość długie lata śpiewałem w chórze for-humans contaminated 0 1 0.00000

dev-0 / 2a976384138d0c2949f4c4288f475a7a6a5f698a
Metric Score
Likelihood 0.00000
Accuracy 0.64646
Likelihood Accuracy
+H 0.00000 0.67500
+C 1.00000 1.00000
-C 0.00000 0.64646

worst items

note: the gold standard is taken from the submission itself, not from the challenge data!
# input expected output actual output test-A Accuracy +C
1 Cierpiałem na straszne lagi – kilkanaście sekund lub dłużej czarnego ekranu przy próbie przełączenia się / uruchomienia prawie każdej aplika… 1 1 1.00000

Compare with other submission