Diachronic normalisation of Polish texts

Transform old Polish texts into modern spelling. [ver. 1.0.0]

Log in to write a comment

ked   2023-10-12 10:01
submitted a solution:plt5-base_normalizer_test_pruned, no finetuning
p/tlen   2022-07-07 06:18
submitted a solution:Lucene Transducers ver. 0.25-SNAPSHOT
p/tlen   2022-07-06 18:45
submitted a solution:Lucene Transducers ver. 0.24
p/tlen   2022-07-06 18:35
submitted a solution:Lucene Transducers ver. 0.24
p/tlen   2022-07-06 18:23
submitted a solution:Lucene Transducers ver. 0.24-SNAPSHOT
p/tlen   2022-07-06 18:21
submitted a solution:Lucene Transducers ver. 0.24-SNAPSHOT
p/tlen   2022-07-06 14:41
submitted a solution:Lucene Transducers ver. 0.24-SNAPSHOT
p/tlen   2022-02-24 19:47
submitted a solution:Lucene Transducers ver. 0.23-SNAPSHOT
p/tlen   2021-10-20 14:00
submitted a solution:Lucene Transducers ver. 0.23-SNAPSHOT
p/tlen   2021-10-20 11:02
submitted a solution:Lucene Transducers ver. 0.22-SNAPSHOT
[anonymized]   2021-08-15 13:19
submitted a solution:0.22 use nosecondary option
[anonymized]   2021-08-03 20:03
submitted a solution:Lucene transducers 0.22 - move pairs to a separate file
p/tlen   2020-04-22 19:16
submitted a solution:PSI-Toolkit Diachroniser 2020
p/tlen   2019-10-26 19:38
submitted a solution:Lucene Transducers 0.21
p/tlen   2019-10-19 20:13
submitted a solution:Lucene Transducers 20
p/tlen   2018-03-30 12:49
submitted a solution:PSI-Toolkit better-diachronizer
p/tlen   2018-03-17 11:07
submitted a solution:use Lucene token filter with sub-word variants (v. 0.15)
[anonymized]   2018-03-16 20:38
submitted a solution:Raw normalization
p/tlen   2018-03-16 13:25
submitted a solution:use Lucene filter with words mined using word2vec (v. 0.14)
p/tlen   2018-03-16 11:16
submitted a solution:use Lucene filter without OCR fixes (v. 0.13)
p/tlen   2018-03-15 20:55
submitted a solution:use Lucene token filter (v. 0.12)
p/tlen   2018-03-15 20:47
submitted a solution:do nothing