Back to Search Start Over

Automatsko označavanje i analiza teksta za LARSP hrvatskog jezika

Authors :
Tot, Bruno
Lazić, Nikolaj
Mildner, Vesna
Publication Year :
2019

Abstract

Govor je jedan od najznačajnijih načina ljudske komunikacije. Govor i jezik se zajedno razvijaju od rođenja – prvo slušanjem, a zatim imitacijom. Kao i sve ljudske osobine, podložni su poremećajima i oštećenjima. Neka od njih se mogu rehabilitirati zbog čega je važno što prije ih detektirati. U tu svrhu je razvijen LARSP – lingvistički protokol koji objedinjuje procjenu, rehabilitaciju i probir tj. praćenje kako bi omogućio točnu dijagnozu i uspješnu rehabilitaciju. Temelji se na gramatičkoj analizi transkribiranog dječjeg govora. Ručna analiza većeg korpusa teksta dugotrajna je i podložna ljudskim pogreškama. LARSPTool je alat razvijen kako bi olakšao taj postupak. Automatska predobrada i prikupljanje podataka znatno ubrzavaju analizu i omogućuju analizu mnogo većeg korpusa. LARSPTool također uvodi standard i jednoličnost obrade i prikaza podataka što otvara daljnje mogućnosti za praćenje trendova razvoja govora i jezika kod djece kroz generacije. Trenutna verzija ima integriranu podršku za hrvatski, a kasnije verzije bit će modularne. Također, dostupna je samo za Windows operativne sustave, ali postoji mogućnost razvoja verzija i za druge platforme. Speech is one of the most significant methods of human communication. Speech and language start developing together from birth – first by listening then by imitation. Like all human characteristics it is subject to disorders and defects. Some of them can be remedied which is why it is important to detect them as soon as possible. To that end LARSP was developed – a linguistic protocol which unifies assessment, remediation and screening in order to enable correct diagnosis and successful remediation. It is based on grammatical analysis of transcribed children’s speech. Manual analysis of a larger corpus of text is time-consuming and susceptible 32 to human error. LARSPTool is a tool developed to facilitate that process. Automatic preprocessing and data gathering significantly speed up analysis and enable analysis of a much larger corpus. LARSPTool also introduces a standard and uniformity to both data processing and presentation which opens up further possibilities of tracking children’s speech and language development trends over generations. The current version has integrated support for Croatian but later versions will be modular. Also, it is only available for Windows operating systems but there is a possibility of developing versions for other platforms as well.

Details

Language :
Croatian
Database :
OpenAIRE
Accession number :
edsair.dedup.wf.001..f7647b425ac53f2263fc2f0c82313e07