Finnish Wikipedia 2017, source
No Thumbnail Available
Restricted Availability
Date
Persistent identifier of the Data Catalogue metadata
Creator/contributor
Editor
Journal title
Journal volume
Publisher
Publication Type
Peer Review Status
Repositories
Access rights
Open
ISBN
ISSN
Description
The Finnish Wikipedia 2017 source material corpus is available for download.
The corpus contains all Finnish articles from the online encyclopedia Wikipedia available in 1 January 2018. The text parts of the articles have been extracted from [Wikipedia Dumps](https://dumps.wikimedia.org/) with [WikiExtractor](https://github.com/attardi/wikiextractor).
The corpus has been tokenized and annotated with morpho-syntactic analysis produced with the [Turku Dependency Parser](http://turkunlp.github.io/Finnish-dep-parser/)
License: CC BY https://creativecommons.org/licenses/by/4.0/