default search action
Kimmo Kettunen 0001
Person information
- affiliation: University of Eastern Finland, School of Humanities, Finnish Language and Cultural Research, Joensuu, Finland
- affiliation: University of Helsinki, DH Research, Finland
- affiliation: National Library of Finland, National Digitisation Centre Mikkeli, Helsinki, Finland
- affiliation (PhD 2007): University of Tampere, Department of Information Studies, Finland
Other persons with the same name
- Kimmo Kettunen — disambiguation page
- Kimmo Kettunen 0002 — University of Helsinki of Technology, Signal Processing Laboratory, Laboratory of Telecommunications Technology, Finland
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2023
- [j12]Kimmo Kettunen, Heikki Keskustalo, Sanna Kumpulainen, Tuula Pääkkönen, Juha Rautiainen:
Optical character recognition quality affects subjective user perception of historical newspaper clippings. J. Documentation 79(7): 137-156 (2023) - [j11]Heikki Keskustalo, Laura Korkeamäki, Selja Vanamo, Kimmo Kettunen, Sanna Kumpulainen:
Analyzing gender clues in war-time letters. Digit. Scholarsh. Humanit. 38(1): 209-223 (2023) - 2022
- [c29]Kimmo Kettunen:
Geographic Space in Pentti Haanpää's Novel Korpisotaa - Where Does the War Happen? DHNB 2022: 297-307 - [c28]Kimmo Kettunen, Heikki Keskustalo, Sanna Kumpulainen, Tuula Pääkkönen, Juha Rautiainen:
OCR Quality Affects Perceived Usefulness of Historical Newspaper Clippings - A User Study. IRCDL 2022 - [i4]Kimmo Kettunen, Heikki Keskustalo, Sanna Kumpulainen, Tuula Pääkkönen, Juha Rautiainen:
OCR quality affects perceived usefulness of historical newspaper clippings - a user study. CoRR abs/2203.03557 (2022) - [i3]Kimmo Kettunen, Heikki Keskustalo, Sanna Kumpulainen, Tuula Pääkkönen, Juha Rautiainen:
Optical character recognition quality affects perceived usefulness of historical newspaper clippings. CoRR abs/2206.00369 (2022) - 2020
- [c27]Kimmo Kettunen, Matti La Mela:
Digging Deeper into the Finnish Parliamentary Protocols - Using a Lexical Semantic Tagger for Studying Meaning Change of Everyman's Rights (Allemansrätten). DHN 2020: 63-80 - [c26]Teemu Ruokolainen, Kimmo Kettunen:
Name the Name - Named Entity Recognition in OCRed 19th and Early 20th Century Finnish Newspaper and Journal Collection Data. DHN 2020: 137-156 - [c25]Kimmo Kettunen:
Adding Compound Splitting and Analysis to a Semantic Tagger of Modern Standard Finnish - On the Way to FiSTComp. Baltic HLT 2020: 150-157
2010 – 2019
- 2019
- [c24]Kimmo Kettunen, Teemu Ruokolainen, Erno Liukkonen, Pierrick Tranouez, Daniel Antelme, Thierry Paquet:
Detecting Articles in a Digitized Finnish Historical Newspaper Collection 1771-1929: Early Results Using the PIVAJ Software. DATeCH 2019: 59-64 - [c23]Kimmo Kettunen, Mika Koistinen:
Open Source Tesseract in Re-OCR of Finnish Fraktur from 19th and Early 20th Century Newspapers and Journals - Collected Notes on Quality Improvement. DHN 2019: 270-282 - [c22]Matti La Mela, Minna Tamper, Kimmo Kettunen:
Finding Nineteenth-century Berry Spots: Recognizing and Linking Place Names in a Historical Newspaper Berry-picking Corpus. DHN 2019: 295-307 - [c21]Kimmo Kettunen, Tuula Pääkkönen, Erno Liukkonen:
Clipping the Page - Automatic Article Detection and Marking Software in Production of Newspaper Clippings of a Digitized Historical Journalistic Collection. TPDL 2019: 356-360 - [p1]Jussi Karlgren, Turid Hedlund, Kalervo Järvelin, Heikki Keskustalo, Kimmo Kettunen:
The Challenges of Language Variation in Information Access. Information Retrieval Evaluation in a Changing World 2019: 201-216 - 2018
- [c20]Kimmo Kettunen, Jukka Kervinen, Mika Koistinen:
Creating and Using Ground Truth OCR Sample Data for Finnish Historical Newspapers and Journals. DHN 2018: 162-169 - [c19]Tuula Pääkkönen, Jukka Kervinen, Kimmo Kettunen:
Digitisation and Digital Library Presentation System - A Resource-Conscientious Approach. DHN 2018: 297-305 - [c18]Kimmo Kettunen, Mika Koistinen, Teemu Ruokolainen:
Research and Development Efforts on the Digitized Historical Newspaper and Journal Collection of The National Library of Finland. DHN 2018: 474-479 - 2017
- [j10]Kimmo Kettunen, Eetu Mäkelä, Teemu Ruokolainen, Juha Kuokkala, Laura Löfberg:
Old Content and Modern Tools - Searching Named Entities in a Finnish OCRed Historical Newspaper Collection 1771-1910. Digit. Humanit. Q. 11(3) (2017) - [c17]Kimmo Kettunen, Teemu Ruokolainen:
Names, Right or Wrong: Named Entities in an OCRed Historical Finnish Newspaper Collection. DATeCH 2017: 181-186 - [c16]Mika Koistinen, Kimmo Kettunen, Jukka Kervinen:
How to Improve Optical Character Recognition of Historical Finnish Newspapers Using Open Source Tesseract OCR Engine - Final Notes on Development and Evaluation. LCT 2017: 17-30 - [c15]Kimmo Kettunen, Laura Löfberg:
Tagging Named Entities in 19th Century and Modern Finnish Newspaper Material with a Finnish Semantic Tagger. NODALIDA 2017: 29-36 - [c14]Mika Koistinen, Kimmo Kettunen, Tuula Pääkkönen:
Improving Optical Character Recognition of Finnish Historical Newspapers with a Combination of Fraktur & Antiqua Models and Image Preprocessing. NODALIDA 2017: 277-283 - 2016
- [j9]Tuula Pääkkönen, Jukka Kervinen, Kimmo Kettunen, Asko Nivala, Eetu Mäkelä:
Exporting Finnish Digitized Historical Newspaper Contents for Offline Use. D Lib Mag. 22(7/8) (2016) - [j8]Anni Järvelin, Heikki Keskustalo, Eero Sormunen, Miamaria Saastamoinen, Kimmo Kettunen:
Information retrieval from historical newspaper collections in highly inflectional languages: A query expansion approach. J. Assoc. Inf. Sci. Technol. 67(12): 2928-2946 (2016) - [c13]Kimmo Kettunen, Tuula Pääkkönen, Mika Koistinen:
Between Diachrony and Synchrony: Evaluation of Lexical Quality of a Digitized Historical Finnish Newspaper and Journal Collection with Morphological Analyzers. Baltic HLT 2016: 122-129 - [c12]Kimmo Kettunen, Tuula Pääkkönen:
Measuring Lexical Quality of a Historical Finnish Newspaper Collection ― Analysis of Garbled OCR Data with Basic Language Technology Tools and Means. LREC 2016 - [c11]Kimmo Kettunen, Eetu Mäkelä, Juha Kuokkala, Teemu Ruokolainen, Jyrki Niemi:
Modern Tools for Old Content - in Search of Named Entities in a Finnish OCRed Historical Newspaper Collection 1771-1910. LWDA 2016: 124-135 - [i2]Kimmo Kettunen, Eetu Mäkelä, Teemu Ruokolainen, Juha Kuokkala, Laura Löfberg:
Old Content and Modern Tools - Searching Named Entities in a Finnish OCRed Historical Newspaper Collection 1771-1910. CoRR abs/1611.02839 (2016) - [i1]Kimmo Kettunen, Tuula Pääkkönen:
How to do lexical quality estimation of a large OCRed historical Finnish newspaper collection with scarce resources. CoRR abs/1611.05239 (2016) - 2015
- [c10]Kimmo Kettunen:
Keep, Change or Delete? Setting up a Low Resource OCR Post-correction Framework for a Digitized Old Finnish Newspaper Collection. IRCDL 2015: 95-103 - 2014
- [j7]Kimmo Kettunen:
Can Type-Token Ratio be Used to Show Morphological Complexity of Languages? J. Quant. Linguistics 21(3): 223-245 (2014) - 2012
- [c9]Kimmo Kettunen:
Managing Word Form Variation of Text Retrieval in Practice - why Five Character Truncation Takes it all? Baltic HLT 2012: 111-119 - [c8]Kimmo Kettunen, Paavo Arvola:
Generating Variant Keyword Forms for a Morphologically Complex Language Leads to Successful Information Retrieval with Finnish. IRFC 2012: 113-126 - 2011
- [c7]Jiaul H. Paik, Kimmo Kettunen, Dipasree Pal, Kalervo Järvelin:
Frequent Case Generation in Ad Hoc Retrieval of Three Indian Languages - Bengali, Gujarati and Marathi. FIRE 2011: 38-50
2000 – 2009
- 2009
- [j6]Eija Airio, Kimmo Kettunen:
Does dictionary based bilingual retrieval work in a non-normalized index? Inf. Process. Manag. 45(6): 703-713 (2009) - [j5]Kimmo Kettunen:
Reductive and generative approaches to management of morphological variation of keywords in monolingual information retrieval: An overview. J. Documentation 65(2): 267-290 (2009) - [c6]Kimmo Kettunen:
Choosing the Best MT Programs for CLIR Purposes - Can MT Metrics Be Helpful? ECIR 2009: 706-712 - 2008
- [j4]Markus Sadeniemi, Kimmo Kettunen, Tiina Lindh-Knuutila, Timo Honkela:
Complexity of European Union Languages: A comparative approach. J. Quant. Linguistics 15(2): 185-211 (2008) - [c5]Kimmo Kettunen:
Automatic Generation of Frequent Case Forms of Query Keywords in Text Retrieval. GoTAL 2008: 222-236 - 2007
- [b1]Kimmo Kettunen:
Reductive and Generative Approaches to Morphological Variation of Keywords in Monolingual Information Retrieval. University of Tampere, Finland, 2007 - [j3]Kimmo Kettunen, Eija Airio, Kalervo Järvelin:
Restricted inflectional form generation in management of morphological keyword variation. Inf. Retr. 10(4-5): 415-444 (2007) - [c4]Kimmo Kettunen:
Managing Keyword Variation with Frequency Based Generation of Word Forms in IR. NODALIDA 2007: 318-323 - [c3]Kimmo Kettunen:
Management of keyword variation with frequency based generation of word forms in IR. SIGIR 2007: 691-692 - 2006
- [j2]Kimmo Kettunen:
Developing an automatic linguistic truncation operator for best-match retrieval of Finnish in inflected word form text database indexes. J. Inf. Sci. 32(5): 465-479 (2006) - [c2]Kimmo Kettunen, Markus Sadeniemi, Tiina Lindh-Knuutila, Timo Honkela:
Analysis of EU Languages Through Text Compression. FinTAL 2006: 99-109 - [c1]Kimmo Kettunen, Eija Airio:
Is a Morphologically Complex Language Really that Complex in Full-Text Retrieval? FinTAL 2006: 411-422 - 2005
- [j1]Kimmo Kettunen, Tuomas Kunttu, Kalervo Järvelin:
To stem or lemmatize a highly inflectional language in a probabilistic IR environment? J. Documentation 61(4): 476-496 (2005)
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-10-07 21:23 CEST by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint