Classification of US Supreme Court Cases using BERT-Based Techniques

Vatsal, Shubham; Meyers, Adam; Ortega, John E.

Computer Science > Computation and Language

arXiv:2304.08649 (cs)

[Submitted on 17 Apr 2023 (v1), last revised 24 Jul 2023 (this version, v3)]

Title:Classification of US Supreme Court Cases using BERT-Based Techniques

Authors:Shubham Vatsal, Adam Meyers, John E. Ortega

View PDF

Abstract:Models based on bidirectional encoder representations from transformers (BERT) produce state of the art (SOTA) results on many natural language processing (NLP) tasks such as named entity recognition (NER), part-of-speech (POS) tagging etc. An interesting phenomenon occurs when classifying long documents such as those from the US supreme court where BERT-based models can be considered difficult to use on a first-pass or out-of-the-box basis. In this paper, we experiment with several BERT-based classification techniques for US supreme court decisions or supreme court database (SCDB) and compare them with the previous SOTA results. We then compare our results specifically with SOTA models for long documents. We compare our results for two classification tasks: (1) a broad classification task with 15 categories and (2) a fine-grained classification task with 279 categories. Our best result produces an accuracy of 80\% on the 15 broad categories and 60\% on the fine-grained 279 categories which marks an improvement of 8\% and 28\% respectively from previously reported SOTA results.

Subjects:	Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
Cite as:	arXiv:2304.08649 [cs.CL]
	(or arXiv:2304.08649v3 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2304.08649

Submission history

From: Shubham Vatsal [view email]
[v1] Mon, 17 Apr 2023 22:53:54 UTC (214 KB)
[v2] Tue, 16 May 2023 19:55:51 UTC (214 KB)
[v3] Mon, 24 Jul 2023 15:33:25 UTC (200 KB)

Computer Science > Computation and Language

Title:Classification of US Supreme Court Cases using BERT-Based Techniques

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Classification of US Supreme Court Cases using BERT-Based Techniques

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators