default search action
Nikola Ljubesic
Person information
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [j18]Tran Thi Hong Hanh, Matej Martinc, Andraz Repar, Nikola Ljubesic, Antoine Doucet, Senja Pollak:
Can cross-domain term extraction benefit from cross-lingual transfer and nested term labeling? Mach. Learn. 113(7): 4285-4314 (2024) - [j17]Valentin Hofmann, Goran Glavas, Nikola Ljubesic, Janet B. Pierrehumbert, Hinrich Schütze:
Geographic Adaptation of Pretrained Language Models. Trans. Assoc. Comput. Linguistics 12: 411-431 (2024) - [c85]Johannes Kiesel, Çagri Çöltekin, Maximilian Heinrich, Maik Fröbe, Milad Alshomary, Bertrand De Longueville, Tomaz Erjavec, Nicolas Handke, Matyás Kopp, Nikola Ljubesic, Katja Meden, Nailia Mirzakhmedova, Vaidas Morkevicius, Theresa Reitis-Münstermann, Mario Scharfbillig, Nicolas Stefanovitch, Henning Wachsmuth, Martin Potthast, Benno Stein:
Overview of Touché 2024: Argumentation Systems. CLEF (2) 2024: 308-332 - [c84]Johannes Kiesel, Çagri Çöltekin, Maximilian Heinrich, Maik Fröbe, Milad Alshomary, Bertrand De Longueville, Tomaz Erjavec, Nicolas Handke, Matyás Kopp, Nikola Ljubesic, Katja Meden, Nailia Mirzakhmedova, Vaidas Morkevicius, Theresa Reitis-Münstermann, Mario Scharfbillig, Nicolas Stefanovitch, Henning Wachsmuth, Martin Potthast, Benno Stein:
Overview of Touché 2024: Argumentation Systems. CLEF (Working Notes) 2024: 3341-3366 - [c83]Filip Dobranic, Bojan Evkoski, Nikola Ljubesic:
A Lightweight Approach to a Giga-Corpus of Historical Periodicals: The Story of a Slovenian Historical Newspaper Collection. LREC/COLING 2024: 695-703 - [c82]Nikola Ljubesic, Taja Kuzman:
CLASSLA-web: Comparable Web Corpora of South Slavic Languages Enriched with Linguistic and Genre Annotation. LREC/COLING 2024: 3271-3282 - [c81]Rik van Noord, Taja Kuzman, Peter Rupnik, Nikola Ljubesic, Miquel Esplà-Gomis, Gema Ramírez-Sánchez, Antonio Toral:
Do Language Models Care about Text Quality? Evaluating Web-Crawled Corpora across 11 Languages. LREC/COLING 2024: 5221-5234 - [c80]Darinka Verdonik, Kaja Dobrovoljc, Tomaz Erjavec, Nikola Ljubesic:
Gos 2: A New Reference Corpus of Spoken Slovenian. LREC/COLING 2024: 7825-7830 - [c79]Michal Mochtak, Peter Rupnik, Nikola Ljubesic:
The ParlaSent Multilingual Training Dataset for Sentiment Identification in Parliamentary Proceedings. LREC/COLING 2024: 16024-16036 - [c78]Johannes Kiesel, Çagri Çöltekin, Maximilian Heinrich, Maik Fröbe, Milad Alshomary, Bertrand De Longueville, Tomaz Erjavec, Nicolas Handke, Matyás Kopp, Nikola Ljubesic, Katja Meden, Nailia Mirzakhmedova, Vaidas Morkevicius, Theresa Reitis-Münstermann, Mario Scharfbillig, Nicolas Stefanovitch, Henning Wachsmuth, Martin Potthast, Benno Stein:
Overview of Touché 2024: Argumentation Systems. ECIR (5) 2024: 466-473 - [c77]Stephen Mayhew, Terra Blevins, Shuheng Liu, Marek Suppa, Hila Gonen, Joseph Marvin Imperial, Börje Karlsson, Peiqin Lin, Nikola Ljubesic, Lester James V. Miranda, Barbara Plank, Arij Riabi, Yuval Pinter:
Universal NER: A Gold-Standard Multilingual Named Entity Recognition Benchmark. NAACL-HLT 2024: 4322-4337 - [i21]Rik van Noord, Taja Kuzman, Peter Rupnik, Nikola Ljubesic, Miquel Esplà-Gomis, Gema Ramírez-Sánchez, Antonio Toral:
Do Language Models Care About Text Quality? Evaluating Web-Crawled Corpora Across 11 Languages. CoRR abs/2403.08693 (2024) - [i20]Nikola Ljubesic, Taja Kuzman:
CLASSLA-web: Comparable Web Corpora of South Slavic Languages Enriched with Linguistic and Genre Annotation. CoRR abs/2403.12721 (2024) - [i19]Nikola Ljubesic, Vít Suchomel, Peter Rupnik, Taja Kuzman, Rik van Noord:
Language Models on a Diet: Cost-Efficient Development of Encoders for Closely-Related Languages via Additional Pretraining. CoRR abs/2404.05428 (2024) - [i18]Çagri Çöltekin, Matyás Kopp, Katja Meden, Vaidas Morkevicius, Nikola Ljubesic, Tomaz Erjavec:
Multilingual Power and Ideology Identification in the Parliament: a Reference Dataset and Simple Baselines. CoRR abs/2405.07363 (2024) - [i17]Nikola Ljubesic, Peter Rupnik, Danijel Korzinek:
The ParlaSpeech Collection of Automatically Generated Speech and Text Datasets from Parliamentary Proceedings. CoRR abs/2409.15397 (2024) - 2023
- [j16]Bojan Evkoski, Petra Kralj Novak, Nikola Ljubesic:
Content-based comparison of communities in social networks: Ex-Yugoslavian reactions to the Russian invasion of Ukraine. Appl. Netw. Sci. 8(1): 40 (2023) - [j15]Bojan Evkoski, Petra Kralj Novak, Nikola Ljubesic:
Correction: Content-based comparison of communities in social networks: Ex-Yugoslavian reactions to the Russian invasion of Ukraine. Appl. Netw. Sci. 8(1): 47 (2023) - [j14]Lisa Hilte, Ilia Markov, Nikola Ljubesic, Darja Fiser, Walter Daelemans:
Who are the haters? A corpus-based demographic analysis of authors of hate speech. Frontiers Artif. Intell. 6 (2023) - [j13]Tomaz Erjavec, Maciej Ogrodniczuk, Petya Osenova, Nikola Ljubesic, Kiril Simov, Andrej Pancur, Michal Rudolf, Matyás Kopp, Starkaður Barkarson, Steinþór Steingrímsson, Çagri Çöltekin, Jesse de Does, Katrien Depuydt, Tommaso Agnoloni, Giulia Venturi, María Calzada Pérez, Luciana D. de Macedo, Costanza Navarretta, Giancarlo Luxardo, Matthew Coole, Paul Rayson, Vaidas Morkevicius, Tomas Krilavicius, Roberts Dargis, Orsolya Ring, Ruben van Heusden, Maarten Marx, Darja Fiser:
The ParlaMint corpora of parliamentary proceedings. Lang. Resour. Evaluation 57(1): 415-448 (2023) - [j12]Taja Kuzman, Igor Mozetic, Nikola Ljubesic:
Automatic Genre Identification for Robust Enrichment of Massive Text Collections: Investigation of Classification Methods in the Era of Large Language Models. Mach. Learn. Knowl. Extr. 5(3): 1149-1175 (2023) - [j11]Nikola Ljubesic, Igor Mozetic, Petra Kralj Novak:
Quantifying the impact of context on the quality of manual hate speech annotation. Nat. Lang. Eng. 29(6): 1481-1494 (2023) - [c76]Marta Bañón, Malina Chichirau, Miquel Esplà-Gomis, Mikel L. Forcada, Aarón Galiano Jiménez, Taja Kuzman, Nikola Ljubesic, Rik van Noord, Leopoldo Pla Sempere, Gema Ramírez-Sánchez, Peter Rupnik, Vit Suchomel, Antonio Toral, Jaume Zaragoza-Bernabeu:
MaCoCu: Massive collection and curation of monolingual and bilingual data: focus on under-resourced languages. EAMT 2023: 505-506 - [c75]Agata Savary, Chérifa Ben Khelil, Carlos Ramisch, Voula Giouli, Verginica Barbu Mititelu, Najet Hadj Mohamed, Cvetana Krstev, Chaya Liebeskind, Hongzhi Xu, Sara Stymne, Tunga Güngör, Thomas Pickard, Bruno Guillaume, Eduard Bejcek, Archna Bhatia, Marie Candito, Polona Gantar, Uxoa Iñurrieta Urmeneta, Albert Gatt, Jolanta Kovalevskaite, Timm Lichte, Nikola Ljubesic, Johanna Monti, Carla Parra Escartín, Mehrnoush Shamsfard, Ivelina Stoyanova, Veronika Vincze, Abigail Walsh:
PARSEME corpus release 1.3. MWE@EACL 2023: 24-35 - [c74]Taja Kuzman, Peter Rupnik, Nikola Ljubesic:
Get to Know Your Parallel Data: Performing English Variety and Genre Classification over MaCoCu Corpora. VarDial@EACL 2023: 91-103 - [c73]Peter Rupnik, Taja Kuzman, Nikola Ljubesic:
BENCHić-lang: A Benchmark for Discriminating between Bosnian, Croatian, Montenegrin and Serbian. VarDial@EACL 2023: 113-120 - [c72]Noëmi Aepli, Çagri Çöltekin, Rob van der Goot, Tommi Jauhiainen, Mourhaf Kazzaz, Nikola Ljubesic, Kai North, Barbara Plank, Yves Scherrer, Marcos Zampieri:
Findings of the VarDial Evaluation Campaign 2023. VarDial@EACL 2023: 251-261 - [e8]Yves Scherrer, Tommi Jauhiainen, Nikola Ljubesic, Preslav Nakov, Jörg Tiedemann, Marcos Zampieri:
Tenth Workshop on NLP for Similar Languages, Varieties and Dialects, VarDial@EACL 2023, Dubrovnik, Croatia, May 5, 2023. Association for Computational Linguistics 2023, ISBN 978-1-959429-50-0 [contents] - [i16]Taja Kuzman, Igor Mozetic, Nikola Ljubesic:
ChatGPT: Beginning of an End of Manual Linguistic Data Annotation? Use Case of Automatic Genre Identification. CoRR abs/2303.03953 (2023) - [i15]Noëmi Aepli, Çagri Çöltekin, Rob van der Goot, Tommi Jauhiainen, Mourhaf Kazzaz, Nikola Ljubesic, Kai North, Barbara Plank, Yves Scherrer, Marcos Zampieri:
Findings of the VarDial Evaluation Campaign 2023. CoRR abs/2305.20080 (2023) - [i14]Luka Tercon, Nikola Ljubesic:
CLASSLA-Stanza: The Next Step for Linguistic Processing of South Slavic Languages. CoRR abs/2308.04255 (2023) - [i13]Michal Mochtak, Peter Rupnik, Nikola Ljubesic:
The ParlaSent multilingual training dataset for sentiment identification in parliamentary proceedings. CoRR abs/2309.09783 (2023) - [i12]Stephen Mayhew, Terra Blevins, Shuheng Liu, Marek Suppa, Hila Gonen, Joseph Marvin Imperial, Börje F. Karlsson, Peiqin Lin, Nikola Ljubesic, LJ Miranda, Barbara Plank, Arij Riabi, Yuval Pinter:
Universal NER: A Gold-Standard Multilingual Named Entity Recognition Benchmark. CoRR abs/2311.09122 (2023) - 2022
- [c71]Marta Bañón, Miquel Esplà-Gomis, Mikel L. Forcada, Cristian García-Romero, Taja Kuzman, Nikola Ljubesic, Rik van Noord, Leopoldo Pla Sempere, Gema Ramírez-Sánchez, Peter Rupnik, Vít Suchomel, Antonio Toral, Tobias van der Werff, Jaume Zaragoza:
MaCoCu: Massive collection and curation of monolingual and bilingual data: focus on under-resourced languages. EAMT 2022: 301-302 - [c70]Taja Kuzman, Peter Rupnik, Nikola Ljubesic:
The GINCO Training Dataset for Web Genre Identification of Documents Out in the Wild. LREC 2022: 1584-1594 - [i11]Taja Kuzman, Peter Rupnik, Nikola Ljubesic:
The GINCO Training Dataset for Web Genre Identification of Documents Out in the Wild. CoRR abs/2201.03857 (2022) - [i10]Valentin Hofmann, Goran Glavas, Nikola Ljubesic, Janet B. Pierrehumbert, Hinrich Schütze:
Geographic Adaptation of Pretrained Language Models. CoRR abs/2203.08565 (2022) - [i9]Michal Mochtak, Peter Rupnik, Nikola Ljubesic:
The ParlaSent-BCS dataset of sentiment-annotated parliamentary debates from Bosnia-Herzegovina, Croatia, and Serbia. CoRR abs/2206.00929 (2022) - 2021
- [j10]Bojan Evkoski, Nikola Ljubesic, Andraz Pelicon, Igor Mozetic, Petra Kralj Novak:
Evolution of topics and hate speech in retweet network communities. Appl. Netw. Sci. 6(1): 96 (2021) - [j9]Tomaz Erjavec, Darja Fiser, Nikola Ljubesic:
The KAS corpus of Slovenian academic writing. Lang. Resour. Evaluation 55(2): 551-583 (2021) - [c69]Yves Scherrer, Nikola Ljubesic:
Sesame Street to Mount Sinai: BERT-constrained character-level Moses models for multilingual lexical normalization. W-NUT 2021: 465-472 - [c68]Filip Markoski, Elena Markoska, Nikola Ljubesic, Eftim Zdravevski, Ljupco Kocarev:
Cultural Topic Modelling over Novel Wikipedia Corpora for South-Slavic Languages. RANLP 2021: 910-917 - [c67]Bharathi Raja Chakravarthi, Mihaela Gaman, Radu Tudor Ionescu, Heidi Jauhiainen, Tommi Jauhiainen, Krister Lindén, Nikola Ljubesic, Niko Partanen, Ruba Priyadharshini, Christoph Purschke, Eswari Rajagopal, Yves Scherrer, Marcos Zampieri:
Findings of the VarDial Evaluation Campaign 2021. VarDial@EACL 2021: 1-11 - [c66]Yves Scherrer, Nikola Ljubesic:
Social Media Variety Geolocation with geoBERT. VarDial@EACL 2021: 135-140 - [c65]Ilia Markov, Nikola Ljubesic, Darja Fiser, Walter Daelemans:
Exploring Stylometric and Emotion-Based Features for Multilingual Cross-Domain Hate Speech Detection. WASSA@EACL 2021: 149-159 - [e7]Marcos Zampieri, Preslav Nakov, Nikola Ljubesic, Jörg Tiedemann, Yves Scherrer, Tommi Jauhiainen:
Proceedings of the Eighth Workshop on NLP for Similar Languages, Varieties and Dialects, VarDial@EACL 2021, Kiyv, Ukraine, April 20, 2021. Association for Computational Linguistics 2021, ISBN 978-1-954085-12-1 [contents] - [i8]Nikola Ljubesic, Davor Lauc:
BERTić - The Transformer Language Model for Bosnian, Croatian, Montenegrin and Serbian. CoRR abs/2104.09243 (2021) - [i7]Bojan Evkoski, Igor Mozetic, Nikola Ljubesic, Petra Kralj Novak:
Community evolution in retweet networks. CoRR abs/2105.06214 (2021) - [i6]Bojan Evkoski, Andraz Pelicon, Igor Mozetic, Nikola Ljubesic, Petra Kralj Novak:
Retweet communities reveal the main sources of hate speech. CoRR abs/2105.14898 (2021) - 2020
- [j8]Darja Fiser, Nikola Ljubesic, Tomaz Erjavec:
The Janes project: language resources and tools for Slovene user generated content. Lang. Resour. Evaluation 54(1): 223-246 (2020) - [c64]Simon Krek, Spela Arhar Holdt, Tomaz Erjavec, Jaka Cibej, Andraz Repar, Polona Gantar, Nikola Ljubesic, Iztok Kosem, Kaja Dobrovoljc:
Gigafida 2.0: The Reference Corpus of Written Standard Slovene. LREC 2020: 3340-3345 - [c63]Carlos Santos Armendariz, Matthew Purver, Matej Ulcar, Senja Pollak, Nikola Ljubesic, Mark Granroth-Wilding:
CoSimLex: A Resource for Evaluating Graded Word Similarity in Context. LREC 2020: 5878-5886 - [c62]Carlos Santos Armendariz, Matthew Purver, Senja Pollak, Nikola Ljubesic, Matej Ulcar, Ivan Vulic, Mohammad Taher Pilehvar:
SemEval-2020 Task 3: Graded Word Similarity in Context. SemEval@COLING 2020: 36-49 - [c61]Mihaela Gaman, Dirk Hovy, Radu Tudor Ionescu, Heidi Jauhiainen, Tommi Jauhiainen, Krister Lindén, Nikola Ljubesic, Niko Partanen, Christoph Purschke, Yves Scherrer, Marcos Zampieri:
A Report on the VarDial Evaluation Campaign 2020. VarDial@COLING 2020: 1-14 - [c60]Yves Scherrer, Nikola Ljubesic:
HeLju@VarDial 2020: Social Media Variety Geolocation with BERT Models. VarDial@COLING 2020: 202-211 - [c59]Loïc Barrault, Magdalena Biesialska, Ondrej Bojar, Marta R. Costa-jussà, Christian Federmann, Yvette Graham, Roman Grundkiewicz, Barry Haddow, Matthias Huck, Eric Joanis, Tom Kocmi, Philipp Koehn, Chi-kiu Lo, Nikola Ljubesic, Christof Monz, Makoto Morishita, Masaaki Nagata, Toshiaki Nakazawa, Santanu Pal, Matt Post, Marcos Zampieri:
Findings of the 2020 Conference on Machine Translation (WMT20). WMT@EMNLP 2020: 1-55 - [e6]Marcos Zampieri, Preslav Nakov, Nikola Ljubesic, Jörg Tiedemann, Yves Scherrer:
Proceedings of the 7th Workshop on NLP for Similar Languages, Varieties and Dialects, VarDial@COLING 2020, Barcelona, Spain (Online), December 13, 2020. International Committee on Computational Linguistics (ICCL) 2020, ISBN 978-1-952148-47-7 [contents]
2010 – 2019
- 2019
- [j7]Katja Zupan, Nikola Ljubesic, Tomaz Erjavec:
How to tag non-standard language: Normalisation versus domain adaptation for Slovene historical and user-generated texts. Nat. Lang. Eng. 25(5): 651-674 (2019) - [c58]Nikola Ljubesic, Kaja Dobrovoljc:
What does Neural Bring? Analysing Improvements in Morphosyntactic Annotation and Lemmatisation of Slovenian, Croatian and Serbian. BSNLP@ACL 2019: 29-34 - [c57]Nikola Ljubesic, Darja Fiser, Tomaz Erjavec:
The FRENK Datasets of Socially Unacceptable Discourse in Slovene and English. TSD 2019: 103-114 - [c56]Nikola Ljubesic, Darja Fiser, Tomaz Erjavec:
KAS-term: Extracting Slovene Terms from Doctoral Theses via Supervised Machine Learning. TSD 2019: 115-126 - [p2]Marcis Pinnis, Nikola Ljubesic, Dan Stefanescu, Inguna Skadina, Marko Tadic, Tatjana Gornostaja, Spela Vintar, Darja Fiser:
Extracting Data from Comparable Corpora. Using Comparable Corpora for Under-Resourced Areas of Machine Translation 2019: 89-139 - [p1]Ahmet Aker, Radu Ion, Nikos Mastropavlos, Monica Lestari Paramita, Marcis Pinnis, Dan Stefanescu, Fangzhong Su, Gregor Thurmair, Elena Irimia, Nikola Ljubesic, Evangelos Kanoulas, Judita Preiss, Robert J. Gaizauskas, Paul D. Clough, Emma Barker, Nikos Glaros, Tiberiu Boros, Inguna Skadina, Andrejs Vasiljevs:
Appendices. Using Comparable Corpora for Under-Resourced Areas of Machine Translation 2019: 291-323 - [e5]Inguna Skadina, Robert J. Gaizauskas, Bogdan Babych, Nikola Ljubesic, Dan Tufis, Andrejs Vasiljevs:
Using Comparable Corpora for Under-Resourced Areas of Machine Translation. Theory and Applications of Natural Language Processing, Springer 2019, ISBN 978-3-319-99003-3 [contents] - [i5]Nikola Ljubesic, Darja Fiser, Tomaz Erjavec:
The FRENK Datasets of Socially Unacceptable Discourse in Slovene and English. CoRR abs/1906.02045 (2019) - [i4]Nikola Ljubesic, Darja Fiser, Tomaz Erjavec:
KAS-term: Extracting Slovene Terms from Doctoral Theses via Supervised Machine Learning. CoRR abs/1906.02053 (2019) - [i3]Carlos Santos Armendariz, Matthew Purver, Matej Ulcar, Senja Pollak, Nikola Ljubesic, Marko Robnik-Sikonja, Mark Granroth-Wilding, Kristiina Vaik:
CoSimLex: A Resource for Evaluating Graded Word Similarity in Context. CoRR abs/1912.05320 (2019) - 2018
- [c55]Rob van der Goot, Nikola Ljubesic, Ian Matroos, Malvina Nissim, Barbara Plank:
Bleaching Text: Abstract Features for Cross-lingual Gender Prediction. ACL (2) 2018: 383-389 - [c54]Nikola Ljubesic, Tomaz Erjavec, Darja Fiser:
Datasets of Slovene and Croatian Moderated News Comments. ALW 2018: 124-131 - [c53]Nikola Ljubesic, Darja Fiser, Anita Peti-Stantic:
Predicting Concreteness and Imageability of Words Within and Across Languages via Word Embeddings. Rep4NLP@ACL 2018: 217-222 - [c52]Marcos Zampieri, Shervin Malmasi, Preslav Nakov, Ahmed Ali, Suwon Shon, James R. Glass, Yves Scherrer, Tanja Samardzic, Nikola Ljubesic, Jörg Tiedemann, Chris van der Lee, Stefan Grondelaers, Nelleke Oostdijk, Dirk Speelman, Antal van den Bosch, Ritesh Kumar, Bornini Lahiri, Mayank Jain:
Language Identification and Morphosyntactic Tagging: The Second VarDial Evaluation Campaign. VarDial@COLING 2018 2018: 1-17 - [c51]Nikola Ljubesic:
Comparing CRF and LSTM performance on the task of morphosyntactic tagging of non-standard varieties of South Slavic languages. VarDial@COLING 2018 2018: 156-163 - [e4]Marcos Zampieri, Preslav Nakov, Nikola Ljubesic, Jörg Tiedemann, Shervin Malmasi, Ahmed Ali:
Proceedings of the Fifth Workshop on NLP for Similar Languages, Varieties and Dialects, VarDial@COLING 2018, Santa Fe, New Mexico, USA, August 20, 2018. Association for Computational Linguistics 2018, ISBN 978-1-948087-55-1 [contents] - [i2]Rob van der Goot, Nikola Ljubesic, Ian Matroos, Malvina Nissim, Barbara Plank:
Bleaching Text: Abstract Features for Cross-lingual Gender Prediction. CoRR abs/1805.03122 (2018) - [i1]Nikola Ljubesic, Darja Fiser, Anita Peti-Stantic:
Predicting Concreteness and Imageability of Words Within and Across Languages via Word Embeddings. CoRR abs/1807.02903 (2018) - 2017
- [j6]Antonio Toral, Miquel Esplà-Gomis, Filip Klubicka, Nikola Ljubesic, Vassilis Papavassiliou, Prokopis Prokopidis, Raphael Rubino, Andy Way:
Crawl and crowd to bring machine translation to under-resourced languages. Lang. Resour. Evaluation 51(4): 1019-1051 (2017) - [c50]Darja Fiser, Tomaz Erjavec, Nikola Ljubesic:
Legal Framework, Dataset and Annotation Schema for Socially Unacceptable Online Discourse Practices in Slovene. ALW@ACL 2017: 46-51 - [c49]Tanja Samardzic, Mirjana Starovic, Zeljko Agic, Nikola Ljubesic:
Universal Dependencies for Serbian in Comparison with Croatian and Other Slavic Languages. BSNLP@EACL 2017: 39-44 - [c48]Nikola Ljubesic, Tomaz Erjavec, Darja Fiser:
Adapting a State-of-the-Art Tagger for South Slavic Languages to Non-Standard Text. BSNLP@EACL 2017: 60-68 - [c47]Nikola Ljubesic, Darja Fiser, Tomaz Erjavec:
Language-independent Gender Prediction on Twitter. NLP+CSS@ACL 2017: 1-6 - [c46]Marcos Zampieri, Shervin Malmasi, Nikola Ljubesic, Preslav Nakov, Ahmed Ali, Jörg Tiedemann, Yves Scherrer, Noëmi Aepli:
Findings of the VarDial Evaluation Campaign 2017. VarDial 2017: 1-15 - [e3]Preslav Nakov, Marcos Zampieri, Nikola Ljubesic, Jörg Tiedemann, Shervin Malmasi, Ahmed Ali:
Proceedings of the Fourth Workshop on NLP for Similar Languages, Varieties and Dialects, VarDial 2017, Valencia, Spain, April 3, 2017. Association for Computational Linguistics 2017, ISBN 978-1-945626-43-2 [contents] - 2016
- [c45]Nikola Ljubesic, Darja Fiser:
Private or Corporate? Predicting User Types on Twitter. NUT@COLING 2016: 4-12 - [c44]Nikola Ljubesic, Darja Fiser:
A Global Analysis of Emoji Usage. WAC@ACL 2016: 82-89 - [c43]Michael Beißwenger, Thierry Chanier, Tomaz Erjavec, Darja Fiser, Axel Herold, Nikola Ljubesic, Harald Lüngen, Céline Poudat, Egon Stemle, Angelika Storrer, Ciara Wigham:
Closing a Gap in the Language Resources Landscape: Groundwork and Best Practices from Projects on Computer-mediated Communication in four European Countries. CLARIN Annual Conference 2016: 1-18 - [c42]Nikola Ljubesic, Tanja Samardzic, Curdin Derungs:
TweetGeo - A Tool for Collecting, Processing and Analysing Geo-encoded Linguistic Data. COLING 2016: 3412-3421 - [c41]Víctor M. Sánchez-Cartagena, Nikola Ljubesic, Filip Klubicka:
Dealing with Data Sparseness in SMT with Factured Models and Morphological Expansion: a Case Study on Croatian. EAMT 2016: 354-360 - [c40]Filip Klubicka, Gema Ramírez-Sánchez, Nikola Ljubesic:
Collaborative Development of a Rule-Based Machine Translator between Croatian and Serbian. EAMT 2016: 361-367 - [c39]Antonio Toral, Sergio Ortiz-Rojas, Mikel L. Forcada, Nikola Ljubesic, Prokopis Prokopidis:
Abu-MaTran: automatic building of machine translation. EAMT (Projects/Products) 2016 - [c38]Nikola Ljubesic, Katja Zupan, Darja Fiser, Tomaz Erjavec:
Normalising Slovene data: historical texts vs. user-generated content. KONVENS 2016 - [c37]Yves Scherrer, Nikola Ljubesic:
Automatic normalisation of the Swiss German ArchiMob corpus using character-level machine translation. KONVENS 2016 - [c36]Nikola Ljubesic, Tomaz Erjavec:
Corpus vs. Lexicon Supervision in Morphosyntactic Tagging: the Case of Slovene. LREC 2016 - [c35]Nikola Ljubesic, Tomaz Erjavec, Darja Fiser:
Corpus-Based Diacritic Restoration for South Slavic Languages. LREC 2016 - [c34]Nikola Ljubesic, Miquel Esplà-Gomis, Antonio Toral, Sergio Ortiz-Rojas, Filip Klubicka:
Producing Monolingual and Parallel Web Corpora at the Same Time - SpiderLing and Bitextor's Love Affair. LREC 2016 - [c33]Nikola Ljubesic, Filip Klubicka, Zeljko Agic, Ivo-Pavao Jazbec:
New Inflectional Lexicons and Training Corpora for Improved Morphosyntactic Annotation of Croatian and Serbian. LREC 2016 - [c32]Vanja Stefanec, Nikola Ljubesic, Jelena Kuvac Kraljevic:
Croatian Error-Annotated Corpus of Non-Professional Written Language. LREC 2016 - [c31]Tomaz Erjavec, Jaka Cibej, Spela Arhar Holdt, Nikola Ljubesic, Darja Fiser:
Gold-Standard Datasets for Annotation of Slovene Computer-Mediated Communication. RASLAN 2016: 29-40 - [c30]Darja Fiser, Nikola Ljubesic:
Detecting Semantic Shifts in Slovene Twitterese. RASLAN 2016: 43-50 - [c29]Shervin Malmasi, Marcos Zampieri, Nikola Ljubesic, Preslav Nakov, Ahmed Ali, Jörg Tiedemann:
Discriminating between Similar Languages and Arabic Dialect Identification: A Report on the Third DSL Shared Task. VarDial@COLING 2016: 1-14 - [c28]Maja Popovic, Kostadin Cholakov, Valia Kordoni, Nikola Ljubesic:
Enlarging Scarce In-domain English-Croatian Corpus for SMT of MOOCs Using Serbian. VarDial@COLING 2016: 97-105 - [e2]Preslav Nakov, Marcos Zampieri, Liling Tan, Nikola Ljubesic, Jörg Tiedemann, Shervin Malmasi:
Proceedings of the Third Workshop on NLP for Similar Languages, Varieties and Dialects, VarDial@COLING 2016, Osaka, Japan, December 12, 2016. The COLING 2016 Organizing Committee 2016, ISBN 978-4-87974-716-7 [contents] - 2015
- [j5]Tomaz Erjavec, Nikola Ljubesic, Natasa Logar:
The slWaC Corpus of the SloveneWeb. Informatica (Slovenia) 39(1) (2015) - [j4]Nikola Ljubesic, Denis Kranjcic:
Discriminating Between Closely Related Languages on Twitter. Informatica (Slovenia) 39(1) (2015) - [j3]Nikola Ljubesic, Kaja Dobrovoljc, Darja Fiser:
*MWELex - MWE Lexica of Croatian, Slovene and Serbian Extracted from Parsed Corpora. Informatica (Slovenia) 39(3) (2015) - [c27]Zeljko Agic, Nikola Ljubesic:
Universal Dependencies for Croatian (that work for Serbian, too). BSNLP@RANLP 2015: 1-8 - [c26]Tanja Samardzic, Nikola Ljubesic, Maja Milicevic:
Regional Linguistic Data Initiative (ReLDI). BSNLP@RANLP 2015: 40-42 - [c25]Antonio Toral, Flammie A. Pirinen, Andy Way, Gema Ramírez-Sánchez, Sergio Ortiz-Rojas, Raphael Rubino, Miquel Esplà-Gomis, Mikel L. Forcada, Vassilis Papavassiliou, Prokopis Prokopidis, Nikola Ljubesic:
Abu-MaTran: Automatic building of Machine Translation. EAMT 2015 - [c24]Nikola Ljubesic, Darja Fiser, Tomaz Erjavec, Jaka Cibej, Dafne Marko, Senja Pollak, Iza Skrjanec:
Predicting the Level of Text Standardness in User-generated Content. RANLP 2015: 371-378 - [c23]Nikola Ljubesic, Miquel Esplà-Gomis, Filip Klubicka, Nives Mikelic Preradovic:
Predicting Inflectional Paradigms and Lemmata of Unknown Words for Semi-automatic Expansion of Morphological Lexicons. RANLP 2015: 379-387 - [c22]Raphaël Rubino, Flammie A. Pirinen, Miquel Esplà-Gomis, Nikola Ljubesic, Sergio Ortiz-Rojas, Vassilis Papavassiliou, Prokopis Prokopidis, Antonio Toral:
Abu-MaTran at WMT 2015 Translation Task: Morphological Segmentation and Web Crawling. WMT@EMNLP 2015: 184-191 - 2014
- [c21]Nikola Ljubesic, Filip Klubicka:
{bs, hr, sr}WaC - Web Corpora of Bosnian, Croatian and Serbian. WaC@EACL 2014: 29-35 - [c20]Nikola Ljubesic, Tomaz Erjavec, Darja Fiser:
Standardizing Tweets with Character-Level Machine Translation. CICLing (2) 2014: 164-175 - [c19]Miquel Esplà-Gomis, Filip Klubicka, Nikola Ljubesic, Sergio Ortiz-Rojas, Vassilis Papavassiliou, Prokopis Prokopidis:
Comparing two acquisition systems for automatically building an English-Croatian parallel corpus from multilingual websites. LREC 2014: 1252-1258 - [c18]Zeljko Agic, Nikola Ljubesic:
The SETimes.HR Linguistically Annotated Corpus of Croatian. LREC 2014: 1724-1727 - [c17]Nikola Ljubesic, Antonio Toral:
caWaC - A web corpus of Catalan and its application to language modeling and machine translation. LREC 2014: 1728-1732 - [c16]Raphaël Rubino, Antonio Toral, Nikola Ljubesic, Gema Ramírez-Sánchez:
Quality Estimation for Synthetic Parallel Data Generation. LREC 2014: 1843-1849 - [c15]Nikola Ljubesic, Darja Fiser, Tomaz Erjavec:
TweetCaT: a tool for building Twitter corpora of smaller languages. LREC 2014: 2279-2283 - [c14]Marcos Zampieri, Liling Tan, Nikola Ljubesic, Jörg Tiedemann:
A Report on the DSL Shared Task 2014. VarDial@COLING 2014: 58-67 - [e1]Marcos Zampieri, Liling Tan, Nikola Ljubesic, Jörg Tiedemann:
Proceedings of the First Workshop on Applying NLP Tools to Similar Languages, Varieties and Dialects, VarDial@COLING 2014, Dublin, Ireland, August 23, 2014. Association for Computational Linguistics and Dublin City University 2014, ISBN 978-1-873769-39-3 [contents] - 2013
- [j2]Marianna Apidianaki, Nikola Ljubesic, Darja Fiser:
Vector Disambiguation for Translation Extraction from Comparable Corpora. Informatica (Slovenia) 37(2): 193-201 (2013) - [c13]Zeljko Agic, Nikola Ljubesic, Danijela Merkler:
Lemmatization and Morphosyntactic Tagging of Croatian and Serbian. BSNLP@ACL 2013: 48-57 - [c12]Nikola Ljubesic, Darja Fiser:
Identifying false friends between closely related languages. BSNLP@ACL 2013: 69-77 - [c11]Marianna Apidianaki, Nikola Ljubesic, Darja Fiser:
Cross-lingual WSD for Translation Extraction from Comparable Corpora. BUCC@ACL 2013: 1-10 - 2012
- [c10]Jörg Tiedemann, Nikola Ljubesic:
Efficient Discrimination Between Closely Related Languages. COLING 2012: 2619-2634 - [c9]Darja Fiser, Nikola Ljubesic, Ozren Kubelka:
Addressing polysemy in bilingual lexicon extraction from comparable corpora. LREC 2012: 3031-3035 - 2011
- [c8]Darja Fiser, Nikola Ljubesic, Spela Vintar, Senja Pollak:
Building and Using Comparable Corpora for Domain-Specific Bilingual Lexicon Extraction. BUCC@ACL 2011: 19-26 - [c7]Darja Fiser, Nikola Ljubesic:
Bilingual lexicon extraction from comparable corpora for closely related languages. RANLP 2011: 125-131 - [c6]Nikola Ljubesic, Darja Fiser:
Bootstrapping Bilingual Lexicons from Comparable Corpora for Closely Related Languages. TSD 2011: 91-98 - [c5]Nikola Ljubesic, Tomaz Erjavec:
hrWaC and slWac: Compiling Web Corpora for Croatian and Slovene. TSD 2011: 395-402 - 2010
- [j1]Nikola Ljubesic, Petra Bago, Damir Boras:
Statistical Machine Translation of Croatian Weather Forecasts: How Much Data Do We Need? J. Comput. Inf. Technol. 18(4) (2010) - [c4]Zeljko Agic, Nikola Ljubesic, Marko Tadic:
Towards Sentiment Analysis of Financial Texts in Croatian. LREC 2010 - [c3]Nikola Ljubesic, Tomislava Lauc, Damir Boras:
Building a Gold Standard for Event Detection in Croatian. LREC 2010
2000 – 2009
- 2008
- [c2]Nikola Ljubesic, Damir Boras, Nikola Bakaric, Jasmina Njavro:
Comparing measures of semantic similarity. ITI 2008: 675-682 - [c1]Nikola Ljubesic, Tomislava Lauc, Damir Boras:
Generating a Morphological Lexicon of Organization Entity Names. LREC 2008
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-11-05 22:03 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint