The demo includes two operational modes: one focused on attested lexical items, ensuring outputs correspond to existing Italian words, and another dedicated to neologism exploration, allowing users to investigate novel lexical candidates generated by the model. Additional parameters such as semantic tags, usage contexts, and prompt constraints provide fine-grained control over the generation process, supporting both exploratory and research-oriented use cases.
NeoIT5-large builds upon the IT5-large model and has been fine-tuned on a corpus of Italian lexical tasks derived from dictionaries and Wikipedia. The model is openly available on Hugging Face (snizio/NeoIT5-large), reflecting a commitment to reproducibility and open research practices.
With NEOIT5, we aim to offer both an interactive research demonstrator and a practical tool for studying lexical reasoning, creativity, and generalization in Neural Language Models, contributing to ongoing investigations into the linguistic competence of AI systems.
Click here to try the demo.
References
Ciaccio C., Miaschi A., Dell’Orletta F. (2025) “Evaluating Lexical Proficiency in Neural Language Models”. In Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (ACL 2025), July 27–August 1st, 2025, Vienna
(Please cite the papers above if you use NEOIT5 in your research)