View Item 
        •   Utrecht University Student Theses Repository Home
        • UU Theses Repository
        • Theses
        • View Item
        •   Utrecht University Student Theses Repository Home
        • UU Theses Repository
        • Theses
        • View Item
        JavaScript is disabled for your browser. Some features of this site may not work without it.

        Browse

        All of UU Student Theses RepositoryBy Issue DateAuthorsTitlesSubjectsThis CollectionBy Issue DateAuthorsTitlesSubjects

        Character-level Neural Architectures for Jointly Predicting Word Alignments and Word-internal Structure in Morphologically Complex Languages

        Thumbnail
        View/Open
        main.pdf (511.8Kb)
        Publication date
        2017
        Author
        Bijl de Vroe, S.G.C.
        Metadata
        Show full item record
        Summary
        Through a word alignment task between English and Turkish, this project investigates ways to more effectively approach morphologically complex languages in the field of \ac{NLP}. Our models create an inductive bias to focus on word-internal structure, by taking character-level input and jointly predicting alignment, lemmas and morphological tags. Current versions of the model are able to exploit the lemma distribution so that the predicted alignment distribution improves in quality, while possible improvements to the morphological tag side of the architecture are identified. Furthermore, different methods of encoding character-level input are explored, suggesting that modern neural architectures might benefit from using multiple types of encoders in conjunction. Finally, the benefit of moving away from word-level input data towards the character level is further supported.
        URI
        https://studenttheses.uu.nl/handle/20.500.12932/27813
        Collections
        • Theses
        Utrecht university logo