Text Simplification

PaCCSS-IT (Parallel Corpus of Complex-Simple Sentences for ITalian). PaCCSS-IT is a corpus of Complex-Simple Aligned Sentences for ITalian of about 63,000 pairs of sentences automatically built.

TERENCE and TEACHER.
Terence and Teacher are both corpora of original and manually simplified texts aligned at sentence level. Specifically, Terence comprises 32 short Italian novels for children and their manually simplified version for a total of 1036 original and 1060 simplified sentences. Teacher is a corpus of 18 pairs of Italian documents (e.g. literature, handbooks) downloaded from different educational websites and their manually simplified version for a total of 266 original and 255 simplified sentences.