Diachronic normalisation of Polish texts

Transform old Polish texts into modern spelling. [ver. 1.0.0]

Log in to write a comment

ked 2023-10-12 10:01

submitted a solution:plt5-base_normalizer_test_pruned, no finetuning

p/tlen 2022-07-07 06:18

submitted a solution:Lucene Transducers ver. 0.25-SNAPSHOT

p/tlen 2022-07-06 18:45

submitted a solution:Lucene Transducers ver. 0.24

p/tlen 2022-07-06 18:35

submitted a solution:Lucene Transducers ver. 0.24

p/tlen 2022-07-06 18:23

submitted a solution:Lucene Transducers ver. 0.24-SNAPSHOT

p/tlen 2022-07-06 18:21

submitted a solution:Lucene Transducers ver. 0.24-SNAPSHOT

p/tlen 2022-07-06 14:41

submitted a solution:Lucene Transducers ver. 0.24-SNAPSHOT

p/tlen 2022-02-24 19:47

submitted a solution:Lucene Transducers ver. 0.23-SNAPSHOT

p/tlen 2021-10-20 14:00

submitted a solution:Lucene Transducers ver. 0.23-SNAPSHOT

p/tlen 2021-10-20 11:02

submitted a solution:Lucene Transducers ver. 0.22-SNAPSHOT

[anonymized] 2021-08-15 13:19

submitted a solution:0.22 use nosecondary option

[anonymized] 2021-08-03 20:03

submitted a solution:Lucene transducers 0.22 - move pairs to a separate file

p/tlen 2020-04-22 19:16

submitted a solution:PSI-Toolkit Diachroniser 2020

p/tlen 2019-10-26 19:38

submitted a solution:Lucene Transducers 0.21

p/tlen 2019-10-19 20:13

submitted a solution:Lucene Transducers 20

p/tlen 2018-03-30 12:49

submitted a solution:PSI-Toolkit better-diachronizer

p/tlen 2018-03-17 11:07

submitted a solution:use Lucene token filter with sub-word variants (v. 0.15)

[anonymized] 2018-03-16 20:38

submitted a solution:Raw normalization

p/tlen 2018-03-16 13:25

submitted a solution:use Lucene filter with words mined using word2vec (v. 0.14)

p/tlen 2018-03-16 11:16

submitted a solution:use Lucene filter without OCR fixes (v. 0.13)

p/tlen 2018-03-15 20:55

submitted a solution:use Lucene token filter (v. 0.12)

p/tlen 2018-03-15 20:47

submitted a solution:do nothing