ISACCO (Italian School-Age Children COrpus)

ISACCO (Italian School-Age Children COrpus) is a corpus of oral and written productions of TD (typically developing) Italian L1-speaking children attending the second, third and fourth grade of the primary school.
The current version of the corpus comprises 58 oral texts (for a total of 7,429 tokens) and 125 written texts (for a total of 13,909 tokens).
All texts were collected by asking children to perform a narrative retelling task, which was proposed as part of their school activities and administered in the class.
This resource is conceived to support research and computational modeling of “later language acquisition”, with a special emphasis for comparative assessment of language skills along the diamesic dimension and across early school grades.

Download

Click here to download the corpus. (Note: after filling in the request form, the download link will appear at the bottom of the page.)

References

Brunato D., Dell’Orletta F. (2015) ISACCO: a corpus for investigating spoken and written language development in Italian school-age children, In Proceedings of the second Italian Conference of Computational Linguistics (CLiC-it), 3-4 December 2015, Trento.

(Please cite the paper above if you make use of this corpus in your research)