Text2Time: Transformer-based Article Time Period Prediction

Gunasekaran, Karthick Prasad; Babrich, B Chase; Shirodkar, Saurabh; Hwang, Hee

doi:10.13140/RG.2.2.29195.36641

Computer Science > Computation and Language

arXiv:2304.10859 (cs)

[Submitted on 21 Apr 2023 (v1), last revised 24 Apr 2023 (this version, v2)]

Title:Text2Time: Transformer-based Article Time Period Prediction

Authors:Karthick Prasad Gunasekaran, B Chase Babrich, Saurabh Shirodkar, Hee Hwang

View PDF

Abstract:The task of predicting the publication period of text documents, such as news articles, is an important but less studied problem in the field of natural language processing. Predicting the year of a news article can be useful in various contexts, such as historical research, sentiment analysis, and media monitoring. In this work, we investigate the problem of predicting the publication period of a text document, specifically a news article, based on its textual content. In order to do so, we created our own extensive labeled dataset of over 350,000 news articles published by The New York Times over six decades. In our approach, we use a pretrained BERT model fine-tuned for the task of text classification, specifically for time period this http URL model exceeds our expectations and provides some very impressive results in terms of accurately classifying news articles into their respective publication decades. The results beat the performance of the baseline model for this relatively unexplored task of time prediction from text.

Comments:	8 Pages
Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:2304.10859 [cs.CL]
	(or arXiv:2304.10859v2 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2304.10859
Related DOI:	https://doi.org/10.13140/RG.2.2.29195.36641

Submission history

From: Karthick Prasad Gunasekaran [view email]
[v1] Fri, 21 Apr 2023 10:05:03 UTC (3,908 KB)
[v2] Mon, 24 Apr 2023 03:56:03 UTC (3,907 KB)

Computer Science > Computation and Language

Title:Text2Time: Transformer-based Article Time Period Prediction

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Text2Time: Transformer-based Article Time Period Prediction

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators