List challenges

CAICCAIC: Centre for Artificial Intelligence Challenge on Conversational AI Correctness

Develop Natural Language Understanding models that are robust to speech recognition errors. [Exact Match Accuracy]

Deadline: 2023-06-22 12:01:00 UTC

Temporal Image Caption Retrieval Competition

Retrieve a caption for a picture from a historical newspaper. [MAP]

Deadline: 2099-06-15 12:01:00 UTC

Pierwsza kolumna zbioru in.tsv zawiera początek dialogu pewnej lektury. Dialogi mogą być być prowadzone przez dowolną ilość osób i nie zawierają innych adnotacji niż sama wypowiedź (np. komentarzy narratora). Poszczególne wypowiedzi w początku dialogu oddzielone są separatorem [SEP]. Każda kolejna kolumna to propozycja kontynuacji dialogu. Kontynuacja dialogu może pochodzić z tej samej lub innej lektury. Istnieje tylko jedna taka poprawna kontynuacja dialogu- ta, która faktycznie występuje w książce. Zadaniem jest zwrócić poprawną kontynuację dialogu. [MAP]

Deadline: 2022-05-23 16:00:01.819979 UTC

Meteo rain

W latach 1980-2020 prowadzono pomiary opadów deszczu. Jednostką jest miesięczna suma opadów w milimetrach. Lista stacji pogodowych znajduje się w pliku dataset_splits.tsv. Stacje pogodowe podzielone są na 3 zbiory: train, dev-0, test-A. [RMSE]

Deadline: 2022-05-23 16:00:01.819979 UTC

Challenging America geo prediction

Guess publication location for a piece of text [Haversine]

Challenging America word-gap prediction

Guess a word in a gap. [PerplexityHashed]

Challenging America year prediction

Guess the time when an excerpt was published [RMSE]
challenging-america diachronic

wmt-2020-pl-en

Translate from Polish to English. [BLEU]

Searching for Legal Clauses by Analogy. Few-shot Contract Discovery Shared Task

The aim of this task is to provide a substrings of requested document representing clauses analogous (semantically and formally equivalent) to provided examples from other documents. [Soft-F1.0]

RetroC2 temporal classification challenge

Guess the publication year of a Polish text. [RMSE]

"He Said She Said" classification challenge (2nd edition)

Give the probability that a text in Polish was written by a man. [Likelihood]

GEval sample challenge

Find tomatos in images. [Soft2D-F2.0]

[???]

Stwórz rozwiązanie wykrywające twarze i tablice rejestracyjne na zdjęciu. Do treningu możesz użyć przykładowo następujących zbiorów danych: https://www.kaggle.com/datasets/andrewmvd/car-plate-detection oraz https://www.kaggle.com/datasets/dataturks/face-detection-in-images. [Soft2D-F2.0]

PlanktonDetectorChallenge

This dataset contains data from WHOI-Plankton dataset: https://github.com/hsosik/WHOI-Plankton Data was downloaded form http://dx.doi.org/10.1575/1912/7341, sampled and filtered, leaving images of higher quality and biggest potential for our University Project. [Accuracy]

CoachStats

Guess whether the weather is good for a walk (given temperature, wind and rain). [Accuracy]

GEval sample challenge

Guess the mass of a planet. [RMSE]

German Passage Retrieval (MIRACL)

German dev dataset from the MIRACL challenge. [NDCG@10]

FCE - Grammatical error detection

Detect errors in English text. [Mean F0.5]

Passage Retrieval

Passage Retrieval is a crucial part of modern open-domain question-answering systems that rely on precise and efficient retrieval components to find passages containing correct answers. [NDCG@10]

Medical search reranker 2

Rerank okapi bm25 search results for medical search engine. [NDCG@50]

Medical referrals

The challenge is an instance of the multi-label text classification problem. [F1]
classification medicine pol

Archive video ocr challenge

Get video frames and read text [F1]
ocr video pol

Book layout

Detect structure of given page. [F1]

Medical search reranker

Rerank okapi bm25 search results for medical search engine. [NDCG@50]

Diachronia year prediction

Guess the time when a Polish excerpt was published [RMSE]

Fake or not?

Czy dany artukuł to prawda czy fake news? [Likelihood]

Fake they say

Czy autor tweeta uważa podlinkowaną stronę za fake news? [LikelihoodHashed]

Object detection challenge

Classify objects and determine their positions on scanned first pages of newspaper images. [F1]
pol diachronic computer-vision

Diachronic full normalization challenge

Do OCR (or OCR post-correction) and modernize Polish text. [CharMatch]
pol modernization diachronic

Diachronic OCR challenge

Do OCR of a Polish historical text (or post-correction of Tesseract OCR) [CharMatch]
pol diachronic

Diachronic normalization challenge

Modernize Polish text. [CharMatch]
pol modernization diachronic

Arxiv tables challange

Key information extraction for scientific tables. Guess the <mask> token in texts based on tables images and context from text. [Accuracy]
eng document-understanding

HWR challenge for index cards (only recognition, no detection)

Handwriting Recognition for Polish index cards. [WER]

GLUE-LM-GAP

GLUE-LM-GAP is LM-GAP challenge base on GLUE benchmark. [PerplexityHashed]

Extract key information from Edgar NDA documents

Extract the information from NDAs (Non-Disclosure Agreements) about the involved parties, jurisdiction, contract term, etc. [F1(UC)]

Detecting handwriting for index cards

Handwriting detection for Polish index cards [F1]

Wiki Historian En

Guess the masked date in an wikipedia article. [RMSE-Against-Interval]

PLEWI - polish errors correction challenge

Correct Polish grammatical errors. [CharMatch]

The task of image retrieval in historical publications

Detect iconography in digitized historical publications. [F1]

Eur-lex-documents

Eur-lex-documents multilabel long documents classification. Assign one, more than one or none labels to each doc. [F1]

twitter 140 year prediction

Dataset from paper "Twitter Sentiment Classification using Distant Supervision" [RMSE]

twitter 140 temporal word gap filling

Dataset from paper "Twitter Sentiment Classification using Distant Supervision" [PerplexityHashed]

twitter 140 temporal sentiment classification

Dataset from paper "Twitter Sentiment Classification using Distant Supervision" [Accuracy]

Ireland news headlines — word gap prediction

Predict the masked word given text and year. [PerplexityHashed]

Bookspines

Matching OCR from bookspines with books. [Accuracy]

Ireland news headlines — year prediction

Predict the year Start Date: 1996-01-01 End Date: 2019-12-31 [RMSE]

Ireland news headlines

Predict the headline category given headine text and year Start Date: 1996-01-01 End Date: 2019-12-31 [Accuracy]

Classify Polish urban legend texts

Classify Polish urban legend texts the way folklorists do. [Accuracy]

Criminal snippets classification challenge

Guess whether a search engine snippet contains possibly criminal content. [F1.0]

HWR challenge for index cards

Handwriting Recognition for Polish index cards [WER]

CoNLL-2003 English Named Entity Recognition.

NER challenge for CoNLL-2003 English. Annotations were taken from University of Antwerp. The English data is a collection of news wire articles from the Reuters Corpus, RCV1. [BIO-F1]

OCR challenge for index cards

The goal of this task is to post-process the output from the Tesseract OCR engine. Alternatively, it could be treated as an OCR, as images are also available. [CharMatch]

Guess the date of liverpool fc subreddit

Guess a reddit date based on its text. All reddits come form liverpoolfc reddit. [RMSE]

milion news headlines

Predict the date of headline. Start Date: 2003-02-19 ; End Date: 2019-12-31 [RMSE]

RetroC-En temporal classification challenge

Guess the publication year of a English text from the Chronicling America collection (1836-1922). [RMSE]

Guess the date of reddits (large edition)

Guess a reddit date based on its text. This is larger version with more reddits and subrredits (topics) than in https://gonito.net/challenge/guess-reddit-date. [MSE]

Guess the date of reddits

Guess a reddit date based on its text. [MSE]

Skeptic vs paranormal subreddits

Classify a reddit as either from Skeptic subreddit or one of the "paranormal" subreddits (Paranormal, UFOs, TheTruthIsHere, Ghosts, ,Glitch-in-the-Matrix, conspiracytheories). [Likelihood]

WikiReading Dataset

Extract information from Wikipedia articles (WikiReading dataset repackaged). [Mean/MultiLabel-F1.0]

Grammatical error detection

Detect errors in english text. [Mean F0.5]

Deadline: 2021-02-26 10:00:00 UTC

Mieszkania5 challenge

Guess the price of a flat/house. [MAE]

Cluster weird stories from Polish newspapers

Cluster weird stories by their types. [NMI]

ASR errors correction

Open Challenge for Correcting Errors of Speech Recognition Systems [WER]

Mieszkania4 Challenge

Guess the prices of flats in Poznan. Edition 2018 [RMSE]

Sport Texts Classification Challenge - Ball

Guess whether the sport is connected to the ball for a Polish article. Evaluation metrics: Accuracy, Likelihood. [Likelihood]

Twitter Sentiment Analysis

Guess the sentiment for texts in English. [Likelihood]

Wikipedia English-to-Polish Transliteration

Translate Wikipedia entries from English to Polish character by character. [WER]

Sport Texts Classification Challenge

Guess the sport discipline for a Polish article. [LikelihoodHashed]

WMT2017 Czech-English machine translation challenge for news

Translate news articles from Czech into English. [BLEU]

PolEval 2018 NER task

Determine nested Named Entities in NKJP-compatible way, that is provide a series of labels with corresponding token indexes. [MultiLabel-F1.0]

Sentiment by emoticons challenge

Give the probability of a positive sentiment for a short Polish text. [LogLoss]

Mieszkania3 Challenge

Guess the prices of flats in Poznan. Edition 2018 [RMSE]

Diachronic normalisation of Polish texts

Transform old Polish texts into modern spelling. [CharMatch]

Titanic challenge

Guess who survived from the disaster. [Accuracy]

Diachronic equivalents

For a given Polish word, as used in a given year, give a diachronic equivalent (a.k.a. temporal word analogy) for a given year. [MAP]

Mushroom classification challenge

Predict whether the mushroom is edible (e) or poisonous (p). [Accuracy]

Cars challenge

Predict the price of a car. [RMSE]

Gratka flats challenge 2017

Predict the price of flats in Poznań. [RMSE]

WMT2017 German-English machine translation challenge for news

Translate news articles from German into English. [BLEU]

Guess a word in a gap in historic texts

Give a probability distribution for a word in a gap in a corpus of Polish historic texts spanning 1814-2013. This is a challenge for (temporal) language models. [LogLossHashed]

Cluster Polish urban legend texts

Cluster Polish urban legend texts the way folklorists do. [NMI]

Clipping death notices

Clip a death notice in a Polish newspaper. [F1]

Sane words challenge

Guess if a given word is a correct Polish word in a given domain. Additionally, you have the information on reported frequency of the word in source texts. [F2.0]

Gratka flats challenge

Predict the price of flats in Poznań. Each entry in training data set is described by: Price, Rooms, SqrMeters, Floor, Location, Description. Evaluation metric is RMSE. [RMSE]

Russian-Polish Opensubtitles

Translate subtitles from Russian into Polish. [BLEU]

Clipping Obituaries

Clip an obituary in a Polish newspaper. (This is only a sample challenge!) [ClippEU]

RetroC temporal classification challenge for Vietnamese

Guess the publication year of a Vietnamese text. The metric is root mean square error. [RMSE]

English-Polish Europarl

Translate Europarl proceedings from English into Polish. [BLEU]

"He Said She Said" classification challenge

Guess whether a text in Polish was written by a man or woman. [Accuracy]

RetroC temporal classification challenge

Guess the publication year of a Polish text. [RMSE]