Guess publication location for a piece of text
Guess the time when an excerpt was published
Translate from Polish to English.
The aim of this task is to provide a substrings of requested document representing clauses analogous (semantically and formally equivalent) to provided examples from other documents.
Guess the publication year of a Polish text.
Give the probability that a text in Polish was written by a man.
Predict the year Start Date: 1996-01-01 End Date: 2019-12-31
Predict the headline category given headine text and year Start Date: 1996-01-01 End Date: 2019-12-31
Classify Polish urban legend texts the way folklorists do.
Guess whether a search engine snippet contains possibly criminal content.
Handwriting Recognition for Polish index cards
NER challenge for CoNLL-2003 English. Annotations were taken from University of Antwerp. The English data is a collection of news wire articles from the Reuters Corpus, RCV1.
The goal of this task is to post-process the output from the Tesseract OCR engine. Alternatively, it could be treated as an OCR, as images are also available.
Guess a reddit date based on its text. All reddits come form liverpoolfc reddit.
Predict the date of headline. Start Date: 2003-02-19 ; End Date: 2019-12-31
Guess the publication year of a English text from the Chronicling America collection (1836-1922).
Guess a reddit date based on its text. This is larger version with more reddits and subrredits (topics) than in https://gonito.net/challenge/guess-reddit-date.
Guess a reddit date based on its text.
Classify a reddit as either from Skeptic subreddit or one of the "paranormal" subreddits (Paranormal, UFOs, TheTruthIsHere, Ghosts, ,Glitch-in-the-Matrix, conspiracytheories).
Extract information from Wikipedia articles (WikiReading dataset repackaged).
Detect errors in english text.
Guess the price of a flat/house.
Cluster weird stories by their types.
Open Challenge for Correcting Errors of Speech Recognition Systems
Guess the prices of flats in Poznan. Edition 2018
Guess whether the sport is connected to the ball for a Polish article. Evaluation metrics: Accuracy, Likelihood.
Guess the sentiment for texts in English.
Translate Wikipedia entries from English to Polish character by character.
Guess the sport discipline for a Polish article.
Translate news articles from Czech into English.
Determine nested Named Entities in NKJP-compatible way, that is provide a series of labels with corresponding token indexes.
Give the probability of a positive sentiment for a short Polish text.
Guess the prices of flats in Poznan. Edition 2018
Transform old Polish texts into modern spelling.
Guess who survived from the disaster.
For a given Polish word, as used in a given year, give a diachronic equivalent (a.k.a. temporal word analogy) for a given year.
Predict whether the mushroom is edible (e) or poisonous (p).
Predict the price of a car.
Predict the price of flats in Poznań.
Translate news articles from German into English.
Give a probability distribution for a word in a gap in a corpus of Polish historic texts spanning 1814-2013. This is a challenge for (temporal) language models.
Cluster Polish urban legend texts the way folklorists do.
Clip a death notice in a Polish newspaper.
Guess if a given word is a correct Polish word in a given domain. Additionally, you have the information on reported frequency of the word in source texts.
Predict the price of flats in Poznań. Each entry in training data set is described by: Price, Rooms, SqrMeters, Floor, Location, Description. Evaluation metric is RMSE.
Translate subtitles from Russian into Polish.
Clip an obituary in a Polish newspaper. (This is only a sample challenge!)
Guess the publication year of a Vietnamese text. The metric is root mean square error.
Translate Europarl proceedings from English into Polish.
Guess whether a text in Polish was written by a man or woman.
Guess the publication year of a Polish text.