default search action
12 WAC@LREC 2020: Marseille, France
- Adrien Barbaresi, Felix Bildhauer, Roland Schäfer, Egon Stemle:
Proceedings of the 12th Web as Corpus Workshop, WAC@LREC 2020, Marseille, France, May 2020. European Language Resources Association 2020, ISBN 979-10-95546-68-9 - Milos Jakubícek, Vojtech Kovár, Pavel Rychlý, Vit Suchomel:
Current Challenges in Web Corpus Building. 1-4 - Adrien Barbaresi, Gaël Lejeune:
Out-of-the-Box and into the Ditch? Multilingual Evaluation of Generic Text Extraction Tools. 5-13 - Veronika Laippala, Samuel Rönnqvist, Saara Hellström, Juhani Luotolahti, Liina Repo, Anna Salmela, Valtteri Skantsi, Sampo Pyysalo:
From Web Crawl to Clean Register-Annotated Corpora. 14-22 - Heidi Jauhiainen, Tommi Jauhiainen, Krister Lindén:
Building Web Corpora for Minority Languages. 23-32 - Balázs Indig, Árpád Knap, Zsófia Sárközi-Lindner, Mária Timári, Gábor Palkó:
The ELTE.DH Pilot Corpus - Creating a Handcrafted Gigaword Web Corpus with Metadata. 33-41 - Shaurya Rawat, Mariano Rico, Óscar Corcho:
Hypernym-LIBre: A Free Web-based Corpus for Hypernym Detection. 42-49 - Shabnam Behzad, Amir Zeldes:
A Cross-Genre Ensemble Approach to Robust Reddit Part of Speech Tagging. 50-56 - Tim Kreutz, Walter Daelemans:
Streaming Language-Specific Twitter Data with Optimal Keywords. 57-64
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.