Artificial Intelligence and Law 30 (1):21-57 (2022)

The paper presents different clustering approaches in legal judgments from the Special Civil Court located at the Federal University of Santa Catarina. The subject is Consumer Law, specifically cases in which consumers claim moral and material compensation from airlines for service failures. To identify patterns from the dataset, we apply four types of clustering algorithms: Hierarchical and Lingo, K-means and Affinity Propagation. We evaluate the results based on the following criteria: entropy and purity; algorithm's ability in providing labels; legal expert’s evaluation; and experimental complexity. The results demonstrate that the most advantageous approach is Hierarchical Clustering, since it has the best entropy and purity numbers, as well as the least difficulty for the expert to analyze the clusters, and the least experimental complexity. The main contribution of the paper is to show the advantages and disadvantages of each approach, especially to identify labels in unstructured and non-indexed legal texts.
Keywords No keywords specified (fix it)
Categories (categorize this paper)
DOI 10.1007/s10506-021-09287-3
Edit this record
Mark as duplicate
Export citation
Find it on Scholar
Request removal from index
Translate to english
Revision history

Download options

PhilArchive copy

Upload a copy of this paper     Check publisher's policy     Papers currently archived: 71,172
External links

Setup an account with your affiliations in order to access resources via your University's proxy server
Configure custom proxy (use this if your affiliation does not provide a proxy)
Through your library

References found in this work BETA

Add more references

Citations of this work BETA

No citations found.

Add more citations

Similar books and articles


Added to PP index

Total views
5 ( #1,207,221 of 2,517,826 )

Recent downloads (6 months)
1 ( #409,482 of 2,517,826 )

How can I increase my downloads?


My notes